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IN THE U.S. PATENT AND TRADEMARK OFFICE 



PENTTILA, Merja et al . Conf.: 
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Group : 
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PROCESS FOR PARTITIONING OF PROTEINS 



AMENDMENT 



BOX PATENT APPLICATION 

Assistant Commissioner for Patents 
Washington, DC 20231 



September 18, 2001 



Sir: 



The following preliminary Amendments and Remarks 



are 



respectfully submitted in connection with the above-identified 
application. 



AMENDMENTS 

Please amend the specification as follows: 

Before line 1, insert 

This application is the national phase under 35 U.S.C. 
§371 of PCT International Application No. PCT/FIOO/0024 9 which 
has an International filing date of March 24, 2000, which 

designated the United States of America and was published in 
English. — 



Appl . No. New 



REMARKS 

The specification has been amended to provide a cross- 
reference to the previously filed International Application. 

A Sequence Listing is part of the present application; the 
Sequence Listing is provided in Computer Readable Form on an 
attached diskette. Applicant submits that the content of the CRF 
file 36838SEQ.TXT is identical to the printed copy of the Sequence 
Listing attached hereto. No new matter is added by the Sequence 
Listing. 

Should there be any outstanding matters that need to be 
resolved in the present application^ the Examiner is respectfully 
requested to contact the undersigned below, to conduct an interview 
in an effort to expedite prosecution in connection with the present 
application . 

If necessary, the Commissioner is hereby authorized in this, 
concurrent, and future replies, to charge payment or credit any 
overpayment to Deposit Account No. 02-2448 for any additional fees 



Appl . No. New 



required under 37 C.F.R. §§ 1.16 or 1.17; particularly, extension of 



time fees. 



Respectfully submitted^ 



BIRCH, STEWART, KOLASCH & BIRCH, LLP 



By r^J/J.^^^f ^3i, i 

Gera^ M. Murphy, Jr., #28, 977 
P.O. Box 747 

GMM/jeb Falls Church, VA 22040-0747 

1163-0355P (703) 205-8000 
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MARKINGS TO SHOW CHANGES MADE 




In the Specification : 

The specification was amended to provide cross-referencing 

to the International Application, 
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5 The present invention relates to isolation and purification of proteins in aqueous two-phase 
systems (ATPS). Specifically the invention provides processes for partitioning of molecules 
of interest in ATPS by fusing said molecules to targeting proteins which have the ability of 
carrying said molecule into one of the phases. 



1 0 Background of the invention 



Liquid-liquid extraction in an aqueous two-phase system (ATPS) can offer a powerful 
technique for isolation and purification of proteins. The separation of macromolecules and 
particles by means of liquid-liquid extraction is well known (Albertsson, 1 986 ; Walter et al , 
15 1 985; Kula, 1 990). Mmnly polyethylene glycol (PEG) - salt, PEG-dextran and PEG-starch 
systems have been in use. More recently detergents and detergents with reversed solubility 
were discovered as suitable methods for separation of macromolecules, and especially for 
the separation of proteins. 



20 An advantage of aqueous two-phase systems (ATPS) is that they are especially suited for 
large scale processing of microbial proteins not only from culture supernatants but also from 
crude extracts containing cells and cell debris (Kula, 1979; Kula, 1985). Characteristic 
features of biological fluids as well as suspensions are rather small particle sizes, low density 
differences between fluid and suspended sohds, high viscosities of the extracts and high 

25 compressibility of the solids (Hustedt ei al, 1985; Bender and Koglin, 1986). These 
attributes decrease the performance of conventional methods for soiid-iiquid separation like 
centrifugation and filtration at the beginning of a protein recovery process. Using an 
aqueous two-phase system removal of solids can be integrated into a liquid-liquid separation 



30 



step, clarification is thus combined with an initial purification (Kula, 1979; Kula, 1985) 



After the extraction process phase separation can be accomplished by settling under gravity 
as well as by centrifugation (Kula, 1 985). ATPS can be applied in various scales from very 
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small laboratory scale to large iadustrial scale thus suiting for various proteins, purposes and 
needs. With regard to industrial purposes commercially available centrifugal separators can 
be used to shorten separation time. Several authors have investigated the potential of 
centrifugal separators of various design for processing of large volumes of aqueous two 
phase systems (Kula, 1979; Kula et al, 1981, Kula et al, 1982; Kula, 1985). In these 
studies the authors have used polymer/polymer or polymer/salt systems and the results of 
these investigations demonstrate the feasibility of continuous separation of aqueous two- 
phase systems in centrifugal separators. 



Extraction systems based on nonionic surfactants have been described as an alternative to 
standard polymer/polymer or polymer/salt systems. Phase forming surfactants are e.g. 
polyoxyethylene type nonionic detergents. The basis of this type of aqueous two-phase 
system is the temperature-dependent reversible hydration of the polar ethylene oxide head 
groups. The temperature at which the phase separation occurs is referred to as the cloud- 
point (cloud-point extraction). This kind of aqueous two-phase system is especially suited 
for the extraction of amphiphilic biomolecules. The potential of this type of two-phase 
system for separating membrane bound protems from cytosolic and peripheral membrane 
protems was first demonstrated by Bordier (1981). Heusch and Kopp (1988) have been able 
to demonstrate that lamellar structures formed in the miscibility gaps of polyglycol ether / 
water systems are responsible for the selective extraction of hydrophobic substances. 

Recently, the successful application of a surfactant-based aqueous two-phase system for the 
extraction of a membrane bound protein (cholesterol oxidase) from the unclarified culture 
medium of the gram-positive microorganism Nocardia rhodochrous on a bench scale has 
been reported (Minuth et aL, 1 995). By addition of only one chemical compound a product 

release through solubilization was possible in homogeneous phase and in a second step a 
clarification as well as an initial purification was achieved by an extraction process at 
elevated temperatures separating the detergent rich phase. A closed concept was further 
developed for the production of the membrane bound enzyme by surfactant-based 
extraction, organic solvent extraction and anion-exchange chromatography, which gave a 
product suitable for analytical apphcations (Minuth et aL, 1996). 
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In aqueous two phase systems the desired target e.g. a protein should partition selectively 
into one phase (preferentially the lighter phase) while the other substances should partition 
into the other phase (preferentially the heavier phase). In PEG/salt and PEG/dextran and 
similar systems there are several driving forces for a substance like charges, hydrophobic, 
hydrophilic forces or the dependence on conformation or ligand interaction (Albertsson, 
1986). The forces leading to separation in detergent based aqueous two phase systems are 
suggested to be primarily hydrophobic (Terstappen et aL, 1993). Even if a lot of work has 
been carried out in the field of prediction in ATPS, none of the designed models provides 
a physical picture of the phase behaviour and prediction is hardly possible (Johansson ei al. , 
1998). 

In ATPS the partitioning coefficient is defined as the concentration (activity in case of an 
enzyme) of the target in the top phase divided by the concentration (enzyme:activity) of the 
target protein in the bottom phase. Partitioning coefficients in ATPS systems are usually in 
the range from less than 1 up to less than 100 (Terstappen et al , 1 992; Terstappen et al , 



Yield: is defined as the amount of target in the top phase divided by the sum of the amount 
of target in top and bottom. This leads to the following equation 



1993). 





1+ 



* 



K 



If the desired substance is directed to the heavier phase (as it can be the case using Triton) 
the yield is defined by 
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The volume ratio of the two coexisting phases are defined by the volumes of the lighter over 
the heavier phase, respectively. 

5 An example of useful protems facing problems in purification in a cost-effective way are the 
commonly used industrial enzymes used as biocatalysts, the giycosyl hydrolases, proteases 
and lipases produced by fungi and bacteria. These are used in e.g. laundry, textile, paper and 
pulp, food and feed industry. The fact that microbes produce many different enzymes during 
their growth and the fact that some of these may be undesired in certain applications leads 

10 to a need to enrich the active component(s). This enrichment can be performed by choosing 
appropriate growth conditions, by genetic engineering and/or by down-stream processing 
(e.g. purification of the active component(s)). 



Purification of proteins are generally performed by chromatography. Usually gel- 
15 chromatographic methods are used based on ion-exchange, hydrophobic interaction, affinity 
chromatography and molecular sieving. Methods like electrophoresis and crystalUsation can 
also be used. These methods are well known in the art and suitable for proteins of fairly high 
market value. In case of bulky enzyme production these methods, however, are too 
expensive in order to keep the final product on a compatible price level Due to similar 
20 properties of these enzymes several purification steps are usually needed to separate the 
proteins from each other. This ofben causes low final yields and therefore a high loss of 
product. 

Many extracellular hydrolases produced by the filamentous fungus Trichoderma are 
25 currently used in different industrial applications in large scale. These hydrolases are e.g. 
hemicellulases (such as xylanases and mannanases), celluiases (such as endoglucanases and 
cellobiohydrolases) and proteases. Purification of these is well known in the art (Bhikhabhai 
et al, 1984; Fere etal, 1995), but for large industrial applications the purification methods 
are too expensive. Alternative methods to enrich these hydrolases have been used, including 
30 deletion of undesired genes by genetic engineering (Suominen et ah, 1 992). However, even 
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after extensive genetic engineering some minor undesired activities may still be present in 

the final product. 

ATPS have been studied in purification of cellulases of T. reesei and the purification of an 
endoglucanase III showed some promising results, enriching the yield of the protein in the 
upper phase (U.S. Pat. 5,139,943). ATPS have also been studied in purification of lipases, 
endoxylanase and natamycin (EP 0 574 050 Al). No K and Y values are, however, 
mentioned. 



As in other protein purification methods, similar properties of proteins produced by an 
organism impair also in ATPS, e.g. selective separation of one protein is not achieved 
optimally. To obtain selectivity in purification affinity chomatographic methods are used 
especially for analytic purposes and in purification of high-value products. These include 
immunoaffinity chromatography and various fiision protein strategies well described in the 
art such as fiasing the protein of interest to an other protein (e.g. glutathione-S-transferase), 
protein domain (e.g. protein A-ZZ domain) or small peptide (e.g. His-tag), which selectively 
bind to the solid carrier and thus the recovery of the fiision partner is obtained as well. The 
fijsion protein can be suitable ft)r the particular purpose as such or cleavage of the product 
fi'om the added fiision partner may be desired. There are well-known methods in the art on 
cleavage of fijsion proteins flrom their partners by proteases, e.g. by factor X, thrombin or 
papain or by genetically introducing a protease cleavage site (e.g. Kex2 site) or autoproses- 
sing domains (e.g. Intein, New England Biolabs) or by chemical cleavage (e.g. CNBr). 

ATPS offer advantages mainly with respect to technology compared with the solid state 
based separation systems e.g. affinity column-based techniques. The scale-up of extractive 
enzymes is relatively simple utilising commercially available equipment and machineiy 
conmion in the chemical industry. In addition, it can be used in a continuous process and it 
can be relatively cost extensive. It can be used as a single step for clarification, concentration 
and purification. ATPS can be used as a first capture-step, but for bulk products often no 
fiirther purification is needed. 
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To aid selective separation in two-phase systems, recent publications have described the 
fusion of small peptide tags of 12 amino acids to the protein to be purified. The most 

successful of these soluble peptides are contaming tryptophans. So far they have mainly been 
applied for very small molecules like the staphylococcal protein A derivative ZZTO 
(Berggrene^a/., 1999; Hassinen a/., 1994; Kohler a/., 1991). 



Use of ATPS has so far been limited to certain targets. Due to the advantages of ATPS in 

protein separation, purification and localisation, highly selective and powerful methods 
should be developed. This is especially important for large scale processes where ATPS in 
general is very inexpensive as a first capture step or as the only step for purification, 
clarification and concentration. The system should be universal so that the technique would 
be strong enough to mediate separation of in principle any component to the desired phase 
irrespective of its size or biochemical properties. 



Description of the invention 



In this invention we describe selective separation and partitioning of molecules and particles 
fusing them with targeting proteins having the capability to carry the molecule or particle 
of interest to the desired phase in ATPS, and to keep it in this phase if wanted. This 
invention is directed to make ATPS usable for every biotechnological product. By addition 
of the targeting protein to selective products, either by genetic tagging of proteins, by 
chemical binding, glueing or by use of any other technique, the product molecule can be 
turned more suitable for separation in ATPS. Using ATPS the product or certain component 
is therefore driven to one phase while the other components or by-products are directed to 
the other phase(s). 



We also describe that efficient separation in ATPS can be obtained using targeting proteins 
which are/can be larger than the described small soluble synthetic peptide tags of 12 amino 
acids or less. These targeting molecules can aid in separating of small molecules but even 
large proteins and particles. Unlike the small peptide tags, it is not necessary that they 
contain tryptophan residues, although they may do so. They can be hydrophobic or 
moderately hydrophobic and/or amphipathic in nature, either in monomeric form or when 
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forming aggregates. Such proteins can be found in nature or they can be designed, or 
obtained through for instance methods known in art for mutant generation, gene shuffling 
or directed evolution. Suitable targeting moiecules can be screened for instance by fusion 
the product of interest to a library of natural or mutant sequences, and screening the ability 
of the fusion molecules to separate in ATPS. Furthermore, any molecule capable of 
separating in ATPS is a suitable targeting molecule. 

Several techniques exploiting purified protein for isolation of the corresponding gene may 
be used to find genes encoding suitable targeting molecules for ATPS. Suitable proteins or 

polypeptides may be purified on the basis of their properties. They can be obtained by 
applying the ceils, cell extracts or culture media to ATPS and recovering the proteins or 
peptides separated into the phase containing the hydrophobic phase material. Suitable 
targeting molecules may also be recovered for example from the culture medium foam 
formed either during the cultivation of a microorganism or caused by bubbling gas through 
the medium. Proteins and peptides suitable as targeting molecules may further be recovered 
from aggregates caused by freezing of culture media. After the targeting molecules have 
been purified, the corresponding genes are isolated using techniques known to a person 
skilled in the art. Such techniques include for example screening of expression libraries using 
antibodies raised against purified polypeptide or peptide, and PCR cloning and screening of 
genomic and/or cDNA libraries with oligonucleotides designed on the basis of N-terminal 
or internal protein sequences. 

Examples of molecules suited as targeting proteins in ATPS found in nature are 
hydrophobin-like small proteins. Hydrophobins are secreted proteins with interesting 
physico-chemical properties that have recently been discoverd from filamentous fungi 
(Wessels, 1994; Wosten and Wessels, 1997; Kershaw and Talbot, 1998). They are usually 
small proteins, approximately 70 to 160 amino acids, containing eight cysteine residues in 
conserved pattern, and do usually not contain tryptophanes. However, also muitimoduiar 
proteins with one or several hydrophobin domains and e.g. proline-rich or asparagine/glycine 
repeats, or hydrophobins containing less than eight cysteine residues have been characterized 
(LoTa^eial, 1994; Lora era/., 1995; Arntz and Tudzynski, 1 997) . Hydrophobins have been 
divided into two classes based on their hydropathy profiles (Wessels, 1 994). 



wo 00/58342 



8 



PCT/FIOO/00249 



Today most protein data exists for the hydrophobins Sc3p of Schizophy Hum commune (class 
I), and cerato-uhnin of Ophiostoma ulmi and cryparin of Cryponectria parasitica (class II), 
although more than 30 gene sequences for hydrophobins have been published (Wosten and 
Wesseis, 1997). HFB genes are often naturally highly expressed but due to special 
requirements in cultivation conditions and the biochemical properties of the proteins, 
purification of HFBs in large amounts have been difficult. For instance only relatively 
moderate production levels of a few mg per liter of Sc3 hydrophobin in static cultures are 
obtained (Han Wosten, personal communication). Published purification procedures include 
e.g. multi-step extraction from fiingal cell walls using organic solvents and bubbling or 
freezing of culture filtrates (Wessels, 1994). No reports of successflil production of 
hydrophobins are available; levels of cerato-ulmin were no higher than those obtained with 
other naturally occuring fungal isolates (Temple et al, 1 997). 

Upon shaking hydrophobin-containing solutions, the protein monomers form rodlet-like 
aggregates. These structures are similar to the ones found on surfaces of aerial stuctures. 
The self-assembly of purified Sc3 hydrophobin into a 10 nm thick amphipatic layer on 
hydrophilic and hydrophobic surfaces has been demonstrated (Wosten etal , 1 994a; Wosten 
et al , 1 994b). This film is very strongly attached to the surface and not broken, for instance, 
by hot detergent. The hydrophobic side of the layer on hydrophilic surfaces shows properties 
similar to teflon (Wessels, 1 994). The Sc3 assemblages, as well as those of cerato-ulmin and 
cryparin, also form on gas-liquid or gas-air interphases thus stabilizing mr bubbles or oil 
droplets in water. 

Surface activity of proteins is generally low but hydrophobins belong to surface-active 
molecules, their surfactant capacity being at least similar to traditional biosurfactants such 
as glycolipids, lipopetides/lipoproteins, phospholipids, neutral lipids and fatty acids (Wosten 
and Wessels, 1 997). In fact Sc3 hydrophobin is the most potent biosurfactant known. It 
lowers the water surface tension to 24 mJm^ at a concentration of 50 ,ug/ml due to a 
conformational change during self-assembly of monomers into an amphipathic film (Wosten 
and Wessels, 1997). 
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Hydrophobin-like molecules vaiy in their properties. For instance, rodlet-forming capacity 
has not been assigned for all hydrophobins (snch as some class II), or they might have a 
weaker tendency to form stable aggregates (Russo et al, 1992; Carpenter et aL, 1992). 
Another group of fungal amphiphatic proteins are repellents (W osten et aL, 1 996 (Ustilago), 

5 for review, see Kershaw and Talbot, 1998). Consequently, other type of proteins suited as 
targeting proteins for ATPS, may have only some of the features assigned to hydrophobins. 
Other suitable protdns are hydrophobic ones such as e.g. lipases, cholesterol oxidase, 
membrane proteins, small peptide drugs Hke nisin, aggregating cell wall proteins, lipopetides 
or any parts of these or combinations of these, and other molecules like glycolipids, 

10 phospholipids, neutral lipids, fatty acids in combination with proteins or peptides. 



In this invention the targeting protein, such as a hydrophobin-like protein or parts of it, is 
bound to the product molecule or the component to be separated. First, phase forming 
materials and eventually possibly also additional salts are added to a watery solution 

15 containing the fusion molecule or component, and optionally also the contaminating 
materials. The added agents are mixed to facilitate their solubilization. As soon as they are 
solubilized the two phases are formed either by gravity settling or centrifugation. In the 
separation the target protein drives the product to for instance the detergent-rich phase 
which could either be the top or the bottom phase. The method is not only useful for 

20 purification of products of interest but also for keeping the product or the component of 
interest, such as a biocatalyst, in a particular phase which enables certain useful biotechnical 
reactions. 



Several ATPS systems are suitable for performing this invention. These include PEG 
25 containing systems, detergent based systems and novel thermoseparating polymers. 
Detergent based systems can be nonionic, zwitterionic, anionic or kationic. The system can 
be based on amphiphilic polymeric detergents, micelle forming polymers. Novel polymers 
can be based on polyethylene-polypropylene copolymers such as pluronic block copolymers, 
Brij, polyoxyethylene derivatives of partial ethers of fatty acids made by adding polyoxyet- 
30 hylene chains to the nonesterlfied body and polyoxyethylene derivatives. The well known 
PEG/salt, PEG/dextran and PEG/starch (or derivatives such as Reppal, hydroxipropyl 
starch) systems where PEG and water are forming the top phase and dextran/starch/salt and 
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water are forming the bottom phase. As salts are used phosphate, citrate, sulfate or others. 
In the present process the target is partitioning mainly to the top phase, while most of the 
contaminants are separating mainly to the bottom phase. Some hydrophobic contaminants 
might partition to the top phase as well. Using detergent based systems only one phase 
forming detergent has to be added. Optionally, salts and other chemicals can be used in 

addition. The mentioned chemicals are added, and the solution is mixed. After mixing the 
separation can take place either by centrifugation or gravity settling. In order to separate 
into two phases the temperature of the solution has to be over the cloud-point of the 
detergent. The solution has to be heated if the cloud-point is not reached otherwise. If 
wanted, a second separation step can follow after a first e?ctraction step and the product rich 
phase can be further purified. Also the remaining product in the product poor and by- 
product rich phase can be re-extracted. Very good K values can be obtained and the yields 
and concentration factors are high. 

The process of the present invention can advantageously be used in laboratory scale but is 
especially suitable for large scale separations. It can successfully be used in the separation 
of proteins and components fi*om large fermentations. Using genetic modifications, the 

method can be used to purify any protein of interest including extracellular enzymes and 
proteins such as cellulases and hemicellulases from mixtures containing large amounts of 
protein such as several grams per liter. Furthermore, this separation can be obtained from 
various culture media including industrial media containing particular materials such as 
cellulose and spent grain. The method can be used to purify the product from culture media 
of strains modified not to produce endogenous hydrophobins. The separation can be done 
directly from the fermentation broth which can additionally contain cells, even viscous 
filamentous fungi. High biomass levels are acceptable for the process as explained in 
example 9. An example is the extracellular endoglucanase I from the fimgus Trichoderma 
reesei which can be tagged for instance with the class 2 HFBI and can for example be 
separated with the nonionic polyoxyethylene C12-C18E05. In this example the detergent 
rich phase is the lighter phase and contains most of the tagged endoglucanase while most 
of the other cellulases, proteases and other enzymes remain in the heavier phase. The 
mycelium separates to the bottom phase, too. The separation can be achieved using 
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separation temperatures higher than 25 **C. The temperature can be decreased if certain salts 
like NaCi or K2SO4 are added. 

The invention describes separation of molecules produced in various different organisms 
such as bacteria, yeast and filamentous fungi. The invention is suitable for purification of 
product molecules from extra- or intracellular locations, including cell wall bound molecules. 
It provides examples how the fusion molecule can be secreted by these different organisms 
but also provide an example how the fusion can be produced intracellularly. 

The invention further describes how fusion molecules consisting of several domains can be 
constructed and successfully expressed and produced. The invention describes fusions of the 
targeting molecule to a small protein (CBD), to a moderately sized protein (EGI) and to a 
huge highly glycosylated protein (Flol), and different domain variations of these. These 
molecules can be ready as such for biotechnical use. Alternatively, the product can be 
cleaved from the targeting protein by any method known in the art such as with proteases 
e.g. thrombin, factor X, papain or by chemical cleavage. Furthermore, ATPS is a preferential 
means to be used to separate the product from the targeting protein after cleavage, or these 
can be separated with other methods known in the art. 

A suprising feature is that the targeting protein can also be used to carry large particles to 
the desired phase in ATPS. This can be obtained if the particles already contain proteins 
suited for targeting such as spores/conidia do in case of fungi. The targeting protein can also 
be attached to the particles or compounds in vitro. If cells are separated, the targeting 
protein can alternatively be expressed in the recombinant cells in such a way that it is 
exposed at the cell surface whereby it mediates the separation of the ceils in ATPS. A 
teaching how this can be done is provided in example 22. Other types of molecules which 
direct the targeting molecule to the cell surface can be found e.g. in the literature including 
bacterial outer membrane proteins and lipoproteins (Stahl and Uhlen, 1 997), and yeast 
proteins a-agglutinin and floccuHn(Schreuder 5/ 1996; Kliseifa/. (1994) WO 94/01567, 
Frenken (1994) WO 94/18330). 
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A further advantage of the system is that the invention combined with ATPS provides a 

means to separate the product or desired component not only from other unnecessary or 

unwanted proteins but also from harmful proteins such as proteases as described in example 
6. Thus, the invention is particularly suited for production and purification of heterologous 
proteins, e.g. sensitive mammalian proteins usually produced in limited amounts in 
heterologous hosts. Such proteins are for instance antibodies or fragments thereof, 
interferon, interleukin, oxidative enzymes and any foreign protein which can otherwise be 
produced in a host. It is possible that separation of the product from e.g. culture medium 
can also be obtained on-line or semi-continuously, thus minimising the effect of proteases 
or other harmful components present in the culture. When produced intracellulariy, the 
invention also provides means to separate the heterologous product, for instance the 

inclusion bodies it may form, from the cellular extracts. 

This invention describes for the first time that fusion proteins containing hydrophobin-like 
molecules can be made and produced in significant amounts despite the very particular 
properties of hydrophobin-like molecules. Importantly, this invention also describes how 
recombinant strains producing increased amounts of hydrophobin-like proteins as such can 
be made. This provides means to produce the targeting protein for uses in which it is wanted 
that the targeting protein is bound to the product or particle in vitro, to enable further 
separation of such molecules or particles in ATPS. 

Importantly, this invention also decsribes how hydrophobin-like molecules can be purified 
in ATPS very efficiently with high K-vaiues. The molecules can be separated in the same 
way as the above mentioned fusions, for instance by PEG systems of by detergent -based 
systems. Separation can be done from the culture medium or from cells. This provides a 
significant improvement in making pure preparates containing hydrophobin-like molecules 
since due to their properties their purification is very complicated and results in losses with 
the previously reported techniques as described above. 

Brief description of drawings 



Fig. I 



shows the map of the plasmid pMQ103 
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shows the map of the plasmid pMQ113. 
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Fig. 3 shows the map of the plasmid pMQ104. 
Fig. 4 shows the map of the plasmid pMQl 14. 
Fig. 5 shows the map of the plasmid pMQ 1 05 , 
Fig* 6 shows the map of the plasmid pMQ 115. 
Fig. 7 shows the map of the plasmid pMQ121 . 

Fig. 8 Coomassie-stained 10% SDS-PAGE of the partitioning of EGIcore-HFBI 

fusion protein in two-phase separation using 5 % of the detergent C12- 
CI8E05. Lane 1, Molecular weight marker; Lane 2, Purified CBHI (4 |ig); 
Lane 3, Purified EGI (4 ^g); Lane 4, l/IO diluted VTT-D-98691 cellulose- 
based culture filtrate; Lanes 5 and 6, 1/10 diluted bottom phase and detergent 
phase (top phase), respectively, after separation of VTT-D-9869 1 culture 
filtrate with 5% detergent; Lane 7, Non-diluted bottom phase; Lane 8, Non- 
diluted VTT-D-9869 1 cellulose culture filtrate. 

Fig, 9 Western analysis of the partitioning of EGIcore-HFBI fijsion protein in two- 
phase separation by using different concentrations of the detergent C12- 
C18E05. Fusion proteins were detected with anti-HFBI antibodies. Lane 1, 
Molecular weight marker; Lane 2, Purified EGI; Lane 3, VTT-D-9869 i 
cellulose culture filtrate; Lanes 4 and 5, Detergent phase (top phase) and 
bottom phase, respectively, after separation of VTT-D-98691 culture filtrate 
with 5% detergent; Lane 6, Same as lane 3, except 2 % detergent was used, 
Lane 7, Same as lane 4, except 2 % detergent was used; Lane 8, Purified EGI; 

Lane 9:, Purified CBHI. 
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Fig. 10 Coomassie-stained 1 0% SDS-PAGE showing further purification of EGIcore- 

HFBI fusion protein from the endogenous CBHI when the top phase was re- 
extracted with 2% detergent. Lane 1, Molecular weight marker; Lane 2, 
Purified CBm (4 ^g); Lane 3, Purified EGI (4 }xg); Lane 4, Detergent phase 
(top phase) after first extraction; Lane 5, Detergent phase (top phase) after 
second extraction. 



Fig. II Coomassie-stained 10% SDS-PAGE analysis of the EGI-HFBI protein when 

treated with thrombin. Lane 1, Molecular weight marker; Lane 2, EGI-HFBI 
(1 mg/mi) treated 72 h with 3 U of thrombin at 24X; Lane 3, Same as lane 2, 
except no thrombin was added; Lane 4, EGI-HFBI (1 mg/ml) treated 48 h with 
9 U of thrombin at 36°C; Lane 5, Same as lane 4, except no thrombin was 
added; Lane 6, Same as lane 5, except no incubation at 36**C. 



Fig, 12 shows the map of the plasmid pTNS 1 3 



Fig. 13 Western analysis of the partitioning of dCBD-HFBI fusion protein in two-phase 

separation using 5 % of the detergent C12-C18E05 Fusion protein was 
detected with anti-HFBI antibody. Lane 1, Four times concentrated culture 
filtrate; Lane 2, Four times concentrated bottom phase; Lane 3, Top phase. 

Fig. 14 shows the map of the plasmid pTNSlS. A non-functional restriction site is 

indicated with an asterisk. 



Fig. 15 shows the map of the plasmid pTNSlS. 



Fig. 16 shows the map of the plasmid pTNS23 . 



Fig. 17 



shows that separation to the detergent phase of cells expressing a hydrophobin 
on cell surface occurs: the top detergent phase ofS. cerevisiae VTT-C-993 1 5 
is turbid, whereas the top detergent phase of the control strain H2155 is clear. 
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Fig. IS shows the map of the piasmid pTNS32. 

Fig. 19 shows the map of the piasmid pTH4. 

Fig. 20 shows the map of the piasmid pTNS30. 

Fig. 21 shows the map of the piasmid pTH 1 . 

Fig, 22 shows the map of the piasmid pTH2. 

Fig. 23 shows the map of the piasmid pKS2. 

Fig. 24 SDS gel showing the purification of HFBl-dCBD in a Berol 532 ATPS system. 

Fig. 25 Western analysis of the partitioning of HFBI-ENA5SoFv fusion protein in two- 
phase separation using 4% of the detergent C12-C18E05. Lanes from left. 
(1) Molecular weight marker, (2) Top (enriched) phase of the strain VTT-D- 
00791, (3) Bottom (depleted) phase of the strain VTT-D-00791. 

Fig. 26 Western analysis of the partitioning of SC3 hydrophobin using SC3 specific 

antibodies. Lane 1, Bottom phase of the strain VTT-D-00792, Lane 2, Top 
phase of the strain VTT-D-00792. 

Fig, 27 HPLC analysis for monitoring of the partitioning of HFBI in 2% Berol 532. 
Fig. 28 HPLC analysis for monitoring of the partitioning of HFBII in 2% Berol 532. 



The invention is further illustrated by the following Examples which describe construction 
of the fusion molecules of the invention, and partitioning of the molecules of interest using 
the process according to the invention. 
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EXAMPLES 
Example 1 

Construction of vectors for expression of EGI and EGIcore HFBI fusion proteins 
under the cbhl and gpdl promoters of Trichoderma and gpdA promoter of Aspergillus 

For construction of an EGI-HFBI fusion protein, hfbl (SEQ ID 1) coding region (from Ser- 

23 to the STOP codon) and a peptide linker (Val Pro Arg Gly Ser Ser Ser Gly Tlir Ala Pro 
Gly Gly) preceding it was amplified with PGR using pTNS9 as a template and as a 5' primer 
TCG GG C ACT ACG TG C CAG TAT AGC AAC GAC TAC TAC TCG CAA TGC 
CTTGTTCCG CGTGGC TCTAGT TCT GGA ACC GCA (SEQ E) 2) and as a 3^ primer 
TCG TAC GGATCC TCA AGC ACC GAC GGC GGT (SEQ ID 3). pTNS9 has been 
decribed in detail in Example 19. The sequence in bold in the 5' primer encodes 16 C-terminal 
residues of EGI. The sequence In italics is a thrombin cleavage site and the underlined 
CACTACGTG is a DraHI site. The underlined GGATCC in the 3' primer is a BamHI site. 
The 280 bp PGR fragment was purified from agarose gel and ligated to pGEM-T T/A vector 
(Promega) resulting in pMQ102. 

For construction of an EGIcore-HFBI fusion protein, the hfbl coding region (as above) was 
amplified with PGR using pTNS9 as a template and as a 5' primer ACT ACA CGG AG G 
AGCTC G ACG ACT TCG AGC AGC CCG AGC TGC ACG CAG AGC AAC GGC 
AAC GGC (SEQ ID 4) and as a 3' primer SEQ ID 3. The sequence in bold in the 5' primer 
encodes amino adds 410-425 in EGI and the underlined GAGCTC is a Sad site. The 260 bp 
PCR fragment was purified from agarose gel and ligated to pPCRU T/A vector (Invitrogen) 
resulting in pMQl 1 1 . 

In the next step Trichoderma expression vectors for production of EGI-HFBI and EGIcore- 
HFBI fiasion proteins under the control of cbhJ promoter and terminator sequences were 
constructed. The expression vector used as a backbone in the constructs is pPLE3 (Nakari 
al (1994) WO 94/04673) which contains a pUCI 8 backbone, and carries the cbhl promoter 
(SEQ ID 5) inserted at the EcoRI site. The cbhl promoter is operably linked to the Ml length 
egll cDNA (SEQ ID 6) coding sequence and to the cbhl transcriptional terminator (SEQ ID 
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7). The plasmid pMQ102 was digested with Drain and BamHI and the released 280 bp 
fragment containing hfbl and linker sequences was purified from agarose gel and ligated to 
pPLE3 digested with Dralll and BamHI. The plasmid pMQl 11 was digested with Sad and 
BamHI and the 260 bp fi*agment containing the hjbl sequence was ligated to pPLE3 digested 
5 with Sad and BamHI. The resulting plasmids pMQ103 (Figure 1) and pMQl 13 (Figure 2) 
carry the coding sequences for fiiU-length EGI linked to HFBI via a peptide Imker and for 
EGIcore linked to HFBI via its own linker region, respectively, under the control of cbh J 
promoter and terminator sequences. 

0 Trichoderma expression vectors for production of EGI-HFBI and EGIcore-HFBI fusion 
proteins under the control of gpdl promoter and terminator sequences of Trichoderma and 
gpdA promoter and trpC terminator sequences of Aspergillus were constructed as follows. A 
SacII site was inserted inbetween the Xbal and Pad sites of pMV4 using as an adapter 

annealed primers TAA CCG CGG T (SEQ ID 8) and CTA GAG CGC GGT TAA T (SEQ 
5 ID 9). The resulting plasmid is pMVQ. pMV4 contains a pNEB193 (New England Biolabs) 
backbone, and carries a 1.2 kb Trichoderma gpdl promoter (SEQ ID 10) and a 1.1 kb gpdl 
terminator (SEQ ED 11) inserted at Sall-Xbal and BamHI- AscI sites, respectively. The 
expression cassettes for EGI-HFBI and EGIcore-HFBI were released from pMQ103 and 
pMQl 13 with Sacn and BamHI, purified from agarose gel and ligated to pMVQ cut with 
0 Sacn and BamHI. The resulting plasmids pMQ104 (Figure 3) and pMQI 14 (Figure 4) carry 
the EGI-HFBI and EGIcore-HFBI cassettes, respectively under the control of Trichoderma 
gpdl transcriptional control sequences. Expression plasmids pMQlOS (Figure 5) and 
pMQllS (Figure 6) containing EGI-HFBI and EGIcore-HFBI cassettes, respectively, 
operably linked to the gpdA promoter and trpC terminator of Aspergillus were constructed. 
5 EGI-HFBI and EGIcore-HFBI cassettes were released from plasmids pMQ 1 04 and pMQ 1 1 4 
with Xbal and BamHI, blunted with T4 DNA polymerase and ligated to Ncol digesteded and 
T4 DNA polymerase treated pAN52-l (SEQ ID 12). pAN52-l contains a pUC18 backbone, 
and carries a 2.3 kb gpdA promoter and a 0.7 kb trpC terminator sequences of A. nidulans. 
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Example 2 

Construction of vectors for over-production of HFBI on cellulase-inducing and - 
repressing media 

For over-expression of HFBI under cbhl promoter the protein coding region of hjbl was 
amplified with PGR using as a template pEAlO (Nakari-Setala et aL, 1996). pEAlO carries a 
5.8 kb genomic Sail fi-agment containing hfbl coding and flanking sequences. GTC AA C 
CGCGG A CTG CGC ATC ATG AAG TTC TTC GCC ATC (SEQ ID 13) was used as 
a 5* primer in the PGR and as a 3' primer SEQ ID 3. The sequence in bold in the 5' primer is 
21 bp of cbhl promoter adjacent to translational start site of the corresponding gene and the 
underlined CC(X^GG is a Kspl site. The obtained fragment of 430 bp was digested with Kspl 
and BamHI and ligated to pMQ103 digested with Kspl and BamHI. The resulting plasmid 
pMQ121 (Figure 7) carries the coding sequence of hfbl operably linked to cbhl 
transcriptional control sequences. pEAlO plasmid is used for over-production of HFBI in 
cellulase-repressing conditions. 

Example 3 

Transformation of Trichoderma and purification of the EGI-HFBI and EGIcore-HFBI 
producing and BDFBI over-producing clones 

Trichoderma reesei strains QM9414 (VTT-D-74075) and Rut-G30 (VTT-D-86271) were 

co-transfomied essentially as described (Penttila et al, 1987) using 3-13 |xg of the plasmids 

pMQ103, pMQ113, pMQ104, pMQ114, pMQ105, pMQ115, pMQ121 and pEAlO and as 

the selection plasmids 1-3 jig pToG202, p3SR2 or pAR021. pToG202 (pUC19 backbone) 

and p3SR2 (pBR322 backbone) plasmids carry 2.7 kb Xbal and 5 kb EcoRI-SaH genomic 

fragments of^. nidukms, respectively, containing the amdS g&n& (Hynes et aL, 1983; Tilbum 

et aL, 1983). pAR021 is essentially the same as pRLMex30 (Mach et al 1994) and carries 

the E. coll hph gene operably linked to 730 bp ofpkil promoter and 1 kb of cbh2 terminator 

sequences of T, reesei. The Amd+ and Hyg+ transformants obtained were streaked three 

times onto plates containing acetamide and hygromycin, respectively (Penttila et al^ 1987) 

Thereafter spore suspensions were made from traasfomants grown on Potato Dextrose agar 
(Difco). 
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1 tie production ot the fusion proteins EGI-HFBI and EGIcore-HFBI and HFBl was tested by 
slot blotting or Western analysis with EGI and HFBI specific antibodies from shake flask or 
microtiter plate cultivations carried out in minimal medium supplemented with either glucose, 
5 lactose or a mixture of Soika flock cellulose and/or spent grain and/or whey. The spore 
suspensions of the fusion protein producing clones were purified to single spore cultures on 
selection plates (containing either acetamide or hygromycin). To determine the best 
producers, production of the fusion proteins was analyzed again from these purified clones as 
described above. 



10 



15 



T, reesei strains selected for fiirther fermentor cultivations are VTT-D-98692 (pEAlO), VTT- 
D-98492 (pMQ121), VTT.D-98693 (pMQ103X VTT-D-98691 (pMQ113), VTT-D-98681 
(pMQlOS) and VTT-D-98682 (pMQllS). These strabs have QM9414 as the host strain. 

VTT-D-99702 rnMOl 13"^ has Rnt-r^O th^^ hr^ct ctrai« 



Example 4 

Cultivation of the EGI-HFBI and EGIcore-BanSI protein producing and HFBI over- 
producing Trichoderma strains 



2 0 EGI-HFBI and EGIcore-HFBI fusions were produced under the cbhl promoter in a 1 5-litre 
fermenter using T. reesei strains VTT-D-98693 (pMQ103) and VTT-D-98691 (pMQil3), 
respectively. Strains were grown 5 days on minimal medium (Penttila et al^ 1987) containing 
4% Solka flock cellulose (James River Corporation, Berlin, NH) and 2 % spent grain 
(Primalco, Koskenkorva, Finland). EGIcore-HFBI was also produced in fermenter (15 1) 

2 5 using the Rut^C30 strain VTT-D-99702 (pMQ 1 1 3) with 4 % lactose medium. To induce the 
production of EGI-HFBI and EGIcore-HFBI fusions under Aspergillus gpdA promoter, T 
ree^e; strains VTT-D-98681 (pMQ105) and VTT-D-98682 (pMQl 15) were cultivated in 15- 
litre fermenter. Strains were grown 3 to 5 days on minimal medium supplemented with 2% 
glucose, 0.2% Peptone, and 0.1% Yeast Extract, and with glucose feed to maintain the 

30 glucose concentration in the range of 1 to 3% throughout the cultivation. HFBI over- 
producing strain VTT-D-98692 (pEAlO) was grown similarly in 15 1 on glucose medium and 
the stram VTT-D-98492 (pQM121) over-producing HFBI under cbhl promoter was 
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cultivated for 7 days in 15-litre fermentor on medium containing 4% Solka flock and 2% 
spent grain. The control cultivations with the host strains of the transformants, QM9414 
(VTT-D-74075) and Rut-C30 (VTT-D-8627i), were canied out on media containing i) 
Solka flock cellulose and either spent grain or whey, ii) lactose and iii) glucose similarly as 
described above. 

\Vhen proper some T reesei transfbnnant strains and their host strains were also cultivated at 
28°C in shake flasks for 5 to 6 days in 50 to 1 50 ml volume of Trichoderma minimal medium 
(Penttiia etal, 1987) suplemented with either 3% Solka flock cellulose and 1% spent grain or 
3-4% glucose with glucose feeding. 

Example 5 

Standard separation assays and analysis 

If not otherwise stated the standard ATPS and subsequent analyses and calculations were 
carried out as explained in this example. 

In general whole fermentation broth, supernatant (biomass separated by centrifugation or 
filtration) or purified proteins in buffer were separated in 10 ml graduated tubes. First 
detergent was added into the tubes and the tubes were then filled to 10 mg with protein 

containing liquid. The amount of detergent in the tube was calculated in weight percent of 

detergents. After thorough mixing in an overhead shaker the separation took place by either 
gravity settling in a water bath at constant temperature or by centrifiigation at constant 

temperature. The separation usually was performed at 30*^C, the standard amount of 
detergent used was 2-5% (w/v). After separation the volume ratio was noted and samples 
were taken fi-om the lighter and heavier phase for analysis. 

Two-phase separations were analysed qualitatively by using SDS-PAGE gels followed by 
visualization of the fusion proteins with Coomassie brilliant blue R-250 (Sigma) or Western 
blotting. Polyclonal anti-HFBI antibody were used in Western analysis for detection of 
EGIcore-HFBI, EGI-HFBl and dCBD-HFBI proteins together with alkaline phosphatase 
conjugated anti-rabbit IgG (Bio-Rad) Alkaline phosphatase activity was detected 
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colorimetxically with BCIP (5-bromo-4-chloro-3-indolyl-phosphate) used in conjunction with 
NBT (nitro blue tetrazolium) (Promega). 

Contaminating endogenous EGI, CBHI and EGm in the top phase was tested with 
5 appropriate antibodies. Acidic protease activity in the top and bottom phase was also tested 
using the SAP method (Food Chemicals Codex, p. 496-497, 1981), which is based on the 30 

min enzymatic hydrolysis of a hemoglobin substrate. Ail reactions were performed at pH 4 7 
and 40 ^'C. Unhydrolyzed substrate was precipitated with 14% TCA and removed by 
filtration. The released tyrosine and tryptophan was determined spectrophotometrically. Total 
0 protein concentrations were determined by Non-Interfering Protein Assay^ (Geno 
Technology, hic). 

EGI activity was detected using 4-methyiumbelliferyl-B-D-cellobioside (MUC) (Sigma M 

6018) as substrate (Van Tilbeurgh H. & Caeyssens M., 1985; Van Tilbeurgh et.aL, 1982). 
5 EGI hydrolyses the B-glycosidic bond and fiuorogenic 4-methylumbelliferone is released, 
which can be measured using a fluorometer equipped with a 360 excitation filter and a 455 
nm emission filter CBHI also hydrolyses the substrate and it was inhibited by addition of 
celiobiose (C-7252, Sigma). EGI containing liquid was added in an appropriate dilution to a 
buffer containing 50 mM sodium acetate buffer (pH 5), 0.6 mM MUC and 4 6 mM 
celiobiose. The mixture was heated to SO^'C. The reaction was stopped after ten minutes 
using 2% Na2C03, pH 10. Purified CBHI was detected using the same assay as for EGI 
without the addition of the inhibitor celiobiose. 

The partition coeflBcient K was defined as the ratio of the measured concentrations or 
activities in the top and bottom phase, respectively. 
The Yield Y was defined as follows: 



1 + 




r 






' K 



where Yt is the Yield of the top phase, Vb and Vt are the volumes of top and bottom phase, 
respectively. The Yield of the bottom phase can be described accordingly 
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The mass baiances, e.g. recovery of ail added protein, were always checked for completeness 
to ensure no artificially high Yield (e.g. due to possible inactivation of the protein in the 
bottom phase). The vahi^ were usually calculated based on total enzyme activity (EGI wt 
plus the EGI-fusion) and thus the values are underestimated for the separation of the fusion as 
5 demonstrated in Example 1 6 . 



10 EGI-HFBI and EGIcore-HFBI fusions produced under the cbhi promoter in a 15-litre 



fermenter on Solka fiock cellulose and spent grain medium as described in Example 4 using T 
reesei strains VTT-D-98693 (pMQ103) and VTT-D-98691 (pMQllS), respectively, were 



1 5 The phases fi"om the two phase separations were analysed qualitatively by using SDS-PAGE 



gels followed by visualization of the fusion proteins with Coomassie brilliant blue or Western 
blotting. Coomassie stained SDS-PAGE (10%) is shown in Figure 8. In the lane containing 
the non-extracted culture filtrate three distinct closely migrating bands can be seen (the 
sample was diluted I/IO with H2O). The topmost band is CBHI, the band in the middle is 
20 EGIcore-HFBI &sion and the lower one endogenous EGL In the samples separated in 
ATPS, only two bands (CBHI and EGI) are seen in the sample from bottom phase and one 
band representing EGIcore-HFBI in the sample obtained from the top phase. 

Western blotting with HFBI antibody showed thick bands for the top phase, whereas for the 

2 5 bottom phase there was only feint band demonstrating that the fusion is separating strongly 

into the detergent top phase. Figure 9 shows the separation of the EGIcore-HFBI fusion 
produced on cellulose media into the top phase. Contaminating endogenous EGI and EGIIl 
in the top phase was tested with appropriate antibodies but no signal was detected. 

3 0 Small amount of endogenous CBHI was found in the upper phase when CBHI antibody was 

used in Western blotting. EGI, EGUI and proteases were not found in the top phase. Further 
purification from the contaminating CBHI was observed when the top phase was re-extracted 



Example 6 

Small scale ATPS separation studies and gel analysis 



separated in small scale ATPS as described above. 
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with 2 % detergent. The Figure 10 shows that the upper phase does not any more contain 
CBHI and pure fusion protein is recovered. 

EGIcore-HFBI was also produced in fermenter (15 1) using the Rut-C30 strain VTT-D- 
5 99702 (pMQ113) with 4 % lactose medium. The separation in ATPS carried out in the 
standard manner gave essentially the same result as the separation from cellulose containing 
medium thus demonstrating that the purij&cation can be carried out from several media 
relevant for large scale industrial use. 

1 0 Acidic protease activity in the top was only 1/15 compared to the bottom phase (table below) 
demonstrating that acidic proteases remain in the bottom phase. 





A (275 nm) 


HUt'/ml 


Bottom phase^ 


0.146 


41.6 


Top phase 


0.009 


2.6 



M/IO diluted bottom phase after separation of VTT-D-98691 culture filtrate with 2% detergent 
15 ^ 1/1 00 diluted bottom phase after separation of VTT-D-9869 1 culture filtrate with 2% detergent , 

^ 1 HUT = en2yme concentration, which in reaction conditions hydrolyses hemoglobin in 1 min so thai 
the absorbance at 275 nm of the formed hydrolysate equals 1 . 1 0 tyrosine/ml 0.006 N HCi solution. 

These results show that the fiision protein can be purified extremely eflaciently and the 
2 0 resulting preparate is free of other proteins produced by the fungus including proteases. 

Example 7 

Recovery of the native EGI in ATPS after thrombin cleavage 

25 EGI-HFBI protein produced by the strain VTT-D-98693 has a thrombin cleavage site 
(LVPRGS) designed in the linker region between the ,EGI CBD and HFBI, which would 
enable the recovery of the native EGI after thrombin cleavage. EGI-HFBI fiision protein was 
purified fi-om the culture filtrate (100 ml) of strain VTT-D-98693 grown on 4% Solka flock 
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cellulose and 2 % spent grain as described in Example 4 using the 2-phase separation system 
(5 % detergent). After removal of the bottom phase the detergent phase was extracted by 
isobutanoL The resulting water phase (--19 ml) was divided in eppendorf tubes and the liquid 
was evaporated with speed vac. Remaining lyophilizate was diluted to 50 mM Tris-Cl (pH 8). 
5 To test the efficiency of thrombin cleavage, 9 units of thrombin (Sigma) was incubated > 24 h 
with 1 mg EGI-HFBI fusion protein in 36 ""C at pH 8.0. Coomassie stained SDS-PAGE (10 
%) was used for detection. 

Only minor cleavage was observed in 48 h under these conditions (Figure 1 1), possibly due to 
0 steric hindrance by 0-glycosylation in the linker. 

Example 8 

Separation of low concentrations of EGIcore-HFBI in ATPS 

5 Detergent based aqueous two-phase systems were successfully applied using very low 
concentrations (diluted) of EGIcore-HFBI fusion protein produced with the cbhJ promoter in 
T reesei VTT.D-98691 (pMQ 113) from a 15 liter cultivation carried out on Solka flock 
cellulose vrith spent grain as described in Example 4. 

0 The original protein concentration of the supernatant was 7.0 mg/ml. This supernatant was 
diluted with de-ionised water by a factor of 100 and 1000, respectively. The fusion protein 
could be separated using 2% (w/w) of the detergent C12-C18E05 with partitioning 
coefiSlcients higher than 5. This is shown in the table below together with the experiment with 
non-diluted supernatant. The partitioning coefficients were calculated based on activity 

5 measurements for total EGI (wild type and fusion protein together). 





EGIcore-HJbBI 
non-diluted supernatant 


dilution 1/100 


dUution 1/1000 


K 


4.1 


5.3 


5.6 


Y [%] 


38 


31 


32 
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Example 9 

Separation of EGlcore-HFBI from fungal biomass containing culture broths 

EGIcore-HFBI from Treesei strain VTT-D-98691 (pMQ 113-2) was cultivated (50 ml in 
5 250 ml shake flasks) on Solka flock celiuiose with spent grain as described in Example 4. 
Directly after the cultivation, part of the whole broth was centrifiaged at 3000 rpm for 30 min, 
supernatant was spilled out and the centrifiiged mycelium was added to the supematant to 
obtain artificial whole broths containing different amounts of biomass. 

10 Using 5% of C12-C18E05 in a 10 g experiment consisting of up to 50% wet biomass 
(weight of wet biomass divided by the sum of wet biomass and supematant) could stili be 
separated without any difficulties. The Yield remained in between 61 and 64 % and therefore 
it is not significantly different in comparison to the experiment carried out with supematant 
only (without mycelium) (see table below). The total recovery of the fusion protein is even 

1 5 higher. This is most probably due to cell attached enzyme extracted in the ATPS increasing 
the total amount of EGI. The partitioning coefiftcient was calculated based on activity 
measurements for total EGI (wild type and fusion protein together). 



K 



Y[%] 



Supematant 



5.5 



62 



25% of wet biomass in 
supematant 



7.3 



66 



40% of wet biomass in 
supematant 



6.4 



61 



50% of wet biomass in 
supematant 



7,6 



64 
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Example 10 

Separation of EGI-HFBI in ATPS 



EGI-HFBI from Trichoderma reesei strain VTT-D-98693 (pMQ 103) from a 15 liter 
5 cultivation carried out on Solka flock cellulose and spent grain as described in Example 4 was 
separated in a 10 g experiment using diflFerent amounts of C12-C18E05. The partitioning 
coefficients are shown below. The partitioniag coefficient was calculated based on activity 
measurements for total EGl (wild type and fusion protein together), and as in previous 
examples the endogenous EGI is included in the partitioning coefficients. 



Detergent [% w/w] 


2 


3 


5 


7 


K 


1.9 


1.8 


1.4 


l.l 



Example 1 1 

Separation of EGIcore-BDFBI in 50 ml 

5 

EGIcore-HFBI from T, reesei strain VTT-D-98691 (pMQ 113) cultivated in 15 liters using 
Solka flock cellulose and spent grain as described in Example 4 was separated in Falcon tubes 
in a 50 g experiment using 5% of C12-CI8E05. A partition coefficient of 2.52 and a yield of 

51 % could be obtained. The separation was performed at SC'C at 3000rpm for 30 minutes 
0 The values are based on activity measurements for total EGI activity (wild type and fusion 
protein together) including endogenous EGL 

Example 12 

Separation of EGIcore-HFBl in ATPS using different detergents 

5 

EGlcore-HFBI from T. reesei strain VTT-D-9869 1 (pMQ 1 1 3) cultivated in 1 5 liters using 
Solka flock cellulose with spent grain as described in Example 4 was separated in a 10 g 
experiment using 2% of detergent in each experiment. The detergents investigated in this 
example were CIO E05, C12 E05, C14 E06 (each Nikko Chemicals, Japan), C12-C1 8 E05 
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CAgrimul NRE 1205", Henkel, Germany), C12/14 5E0, C12/14 6E0 ( Clariant, Germany), 
C9/11 E05.5 (,3errol 266", Akzo Nobel, Germany), Triton X-114 (Sigma, Germany). The 
partition coefificients and yields are listed below. The values are based on activity 

measurements for total EGI activity (wild type and fiision protein together) including 
5 endogenous EGI. 
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1 0 Example 13 

Separation of EGIcore-HFBI in ATPS from glucose grown cultures 

EGIcore-HFBl was separated from cultivation of the Trichoderma reesei strain VTT-D- 
98682 (pMQ115) cultivated with glucose as described in Example 4, The supernatant was 
15 separated with 2% of the detergent C12-C18 E05. The fusion protein could be partitioned 
with a K value of 2.4. In comparison, the K value for the native EGI is 0.3 when measured in 
a similar way for purified EGI. 

Example 14 

2 0 Separation of EGIcore-HFBI using different concentrations of detergent 

EGIcore-HFBl from T reesei VTT-D-98691 (pMQ 113) cultivated in 15 litres using Solka 
flock with spent grmn as described in Example 4 was separated in detergent based ATPS 
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applying different amounts of the detergent C12-C18 EOS on the cell free supernatant. The 
partitiojiing coefficients are shown in the table below. The corresponding gel electrophoresis 
and Western antibody-blots are shown in Figure 8 and Figure 9, respectively. 
The values are based on activity measurements of total EGI activity. 



Amount of detergent C12-C18 E05 


K 


Yield(%) 


L0% 


6.1 


9 


2.0% 


4.1 


38 


3-5% 


3.6 


50 


5.0% 


2.9 


55 


7,5% 


1.7 


53 


10.0% 


1.1 


58 



Example 15 

Re-extraction of the detergent phase 

10 

Detergent based ATPS was applied on EGIcore-HFBI fusion protein containing supernatant 
produced by the strain VTT-D-98691 (pMQ 113) in a shake fiask cultivation. The first 
extraction using C12-C18E05 conducted under the standard conditions shows a partitioning 
coefficient of 16 and a yield of 72 % ( vAM type EGI measured together with fusion protein). 

15 The top phase was re-extracted in 10 mM sodium acetate buffer (pH 5) with 2% of 
detergent. A partitioning coefficient of 52 and a yield of 89 % could be obtained. In the re- 
extraction experiment of the bottom phase (2% of detergent), a small yield of 7.5% and a K 
of 0.8 of EGI activity were achieved. The partitioning coefficients were calculated based on 
activity measurements for total EGI (wild type and fusion protein together). Due to the wild 

2 0 type EGI present in the sample, the yield is at least 72% and the partitioning coefficient at 
least 16 in the first e?ctraction. The SDS-PAGE results of both extraxtions are shown in 
Figure 10. 
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Separation step 


K 


Y[%] 


2% detergent 


16 


72 


reextraction top phase 


52 


89 


reextraction bottom phase 


0.8 


7.5 



Example 16 

Separation of pure cellulases in ATPS 

5 

The effect of HFBI on partitioning and the final yield can fiirther be demonstrated by 
comparing the extraction result of EGIcore-HFB fusion with extraction results obtained with 
purified wild type EGI and EGIcore. The fiision protein is partitioning more than 100 times 
better to the detergent phase (see table belove). 

10 

The improvement on the partitioning of the purified fusion protein from the first extraction 
obtained in the re-extraction (see Example 1 5) can be explaned by the partitioning of the wild 
type EGI as demonstrated with purified wild type EGI in the table below. The wild type EGI 
lowers the partitioning coefiacient in the first extraction (since EGI activity is measured fi-om 
1 5 both top and botom phase), but the absence of it in the re-extraction increases the partitioning 
coefficient of the EGIcore-HFBI fusion. The purity can in addition be demonstrated by 
analysing the partitioning of pure CBHL, which is the major contaminating protein 
corresponding to about 50 % of all secreted T. reesei proteins. Pure CBHI has a partitioning 
coefficient of 0.5 and a yield of 3.6 and is therefore separated fi-om the fusion protein. 

20 



Separation step 


K 


Y[%] 


re-extraction of top phase 


52 


89 


extraction of pure wild type EGI 


0.3 


2.2 


extraction of pure EGI-core 


0.3 


2.3 


extraction of pure CBHI 


0.5 


3.6 
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Using the definitions of K and Y and calculating mass balances, the ratio of the amount of 

EGI fusion protein to EGI wild type can be calculated. The "true" partition coefficients and 
Yields can be concluded from this. "True" means the values which would be detected if it 
would be possible to measure only the amount of EGI-fiision without measuring the amount 
for EGI wild type at the same tima 



10 



The fundament for the calculation is the re-extraction experiment The re-extracted top phase 
is believed to be pure. An example of the measured values and the calculated "true" values 
based on this are shown in the table below for two cultivations of VTT-D-98691 (pMQl 13) 
grown as described in Example 4. 



cultivation 
vessel 



15 liter 
fermenter 

250 ml shake 
flask 



cultivation 
substrate 



whey permeate 



cellulose 



K "with EGI 
wt" 



4 



15 



"true" 
K 



54 



Y [%] "with 
EGI 

wt" 



16 



66 



"true" 
Y [%] 



54 



90 



15 Example 17 



FBI and HFBn purification in ATPS 



20 



HFBI was produced by cultivating the T reesei strain VTT-D-98692 (pEAlO^lOSB) using 
glucose as substrate as described in Example 4. HFBI could be separated using 2% of the 
detergent C12-C18 E05 with a partition coefficient higher than 20 under the standard 
conditions described. 



HFBII was produced by cultivating the T reesei strain VTT-D-74075 (QM9414) on whey 
spent grain as described in Example 4. HFBH could be separated using 2% of the detergent 
2 5 CI 2-C 1 8 E05, exceeding a partition coefficient of 1 0 under the standard conditions. 



Both HFBI and HFBII hydrophobins are thus partitioning well to the upper phase in ATPS. 
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Example 18 

Detergent based ATPS with additional NaCl 

EGIcore-HFBI from which cultivation of T.reesei was separated in a 10 g experiment using 
5% of C12-C18E05. The partitioning coejH&cient of the supernatant was 3.5 with a volume 
ratio of 0.2. Using 1.1 % (wA^) NaCl the partitioning coefficient could be increased to 4.3 
with a lower volume ratio of 0. 14. 

Example 19 

Construction of an E, coli strain expressing a fusion protein BDFBI-dCBD, containing 
hydrophobin I and double cellulose binding (CBD) domains 

A 280 bp DNA fragment containing a modified cbh2 linker region followed by the coding 
region of hfbl from Ser-23 to the STOP codon was amplified by PGR using the piasmid 
pAROl (Nakari-Setala et al, 1996) as a template. The 5' primer was 5' TCT AGC AAG 
CTT GGC TCT ACT TCT GGA ACC GCA CCA GGC GGC AGC AAC GGC AAG 

GGC AAT GTT TGC (SEQ ID 14) and the 3' primer was 5' TCG TAG AAG GTT TCA 
AGC ACC GAC GGC GGT (SEQ ID 15). The sequences in bold in the 5' and 3* primers 
encode tiie modified CBHE linker (Gly Ser Ser Ser Gly Thr Ala Pro Gly Gly) and a 
translational STOP, respectively, and the underiined AAGCTT in both primers is a HindHI 
site. The PGR fragment was purified from agarose gel, digested with HindUI and ligated to 
Hindm digested and SAP treated (Shrimp Alkaline Phosphatase, USB) pSP73 resulting in 
piasmid pTNS9. 

For subsequent cloning of the modified GBHE linker-HFBI fragment to an E coli expression 
vector, pTNS9 was digested with Hindin and the proper fragment was purified from agarose 
gel. This Hindm fragment was cloned to Hindlll digested and SAP treated (Shrimp Alkaline 
Phosphatase, USB) B599 resulting in pTNS13 (Figure 12). The E, coli expression vector 

B599 is essentially the same as the one described by Linder et ai (1996) except that it is 
missing a STOP codon at tiie end of the protein coding sequence. It carries the coding 
sequence for a fusion protein containing GBHII CBD (41 N-tenminal residues of CBHIl) and 
CBHI CBD linked together via CBHI linker region (CBHI linker and CBD are the last 57 
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residues in CBHI). The expression and secretion of the fusion protein in B599 is under the 
cotrol of tac promoter and pelB signal sequence (Takkinen eial, {\99\), pTNS 1 3 expression 
vector thus carries the coding region for a fiision protein of double CBD and HFBI linked in 
frame via the Gly-Ser-Ser-Ser-Gly-Thr-Ala-Pro-Gly-Gly peptide. This vector also contains 
5 the canp gene for selection ofE. coli transformants. pTNSlS plasmid was transformed into £. 
coll strain RV308 (su-, AlacX74, gallSE::OP30S, strA) and this strain was used for 
production of the fusion protein. 

Example 20 

1 0 Separation of HFBI-dCDB molecules expressed in Rcoli in ATPS 

dCBD-HFBI was produced in E. coli strain RV 308 transformed with pTNS13 plasmid as 
described above. The inoculum of RV308/pTNS13 was grown to the exponential growth 
phase in LB medium containing ampicillin (0. 1 g/1) and 1% glucose. Fermentation was carried 

1 5 out using mineral salt medium described by Pack et al (1993) with glucose (feed) in 10 litre 
fermenter. During cialtivation temperature was maintained at 28 "^C and pH was controlled at 
6.8 with NH4OH. Ceil growth was monitored by measuring ODeoo and dry weight of 
biomass. The culture was induced with 50 jliM (final concentration) IPTG (isopropyl-p-D- 
thiogalactopyranoside) at late-exponential growth phase (OD6oo=50-60) to promote fusion 

2 0 protein production. 

Two-phase separation analysis of dCBD-HFBI protein was performed using cuhure filtrate 
and 5% detergent in the total volume of 40 ml. Results from Western blotting showed that 2- 
phase separation with 5 % detergent in the standard way was highly specific also for the 
25 dCBD-HFBI fiision. Strong signal was observed in the sample fi-om the detergent phase 
compared to the sample from the bottom phase as shown in Figure 13 . 
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Example 21 

Construction of yeast strains expressing HFBI-FIoI fusion protein on the cell surface 



For construction of a HFBI-FLOl fUsion protein expression cassette, hfbl (SEQ ID 1) 
5 coding region (from Ser-23 to the STOP codon) was amplified with PGR using pAROl 
(Nakari-Setala et aL^ 1996) as a template and as a 5' primer TCT AGC TCT AGA AGC 

AAC GGC AAC GGC AAT GTT (SEQ ID 16) and as a 3' primer TGC TAG TCG ACC 
TGC TAG CAG CAC CGA CGG CGG TCT G (SEQ ID 17). The underlined sequences in 
the 5' and 3' primers are Xbal and Nhel sites, respectively. The 0.225 bp PGR firagment was 

10 purified fi-om agarose gel and ligated to pGEM-T vector (Promega) resulting in pTNSlO. 
The hfbl firagment was released fi-om pTNSIO with Xbal and Nhel and ligated to pTNSlS 
cut with the same restriction enzymes. Plasmid pTNSlS (Figure 14) is essentially the same as 
plasmid pBR-ADHl-FLOlL by Watari et al 1994 except that a Nhel site in the pBR322 
backbone has been replaced by a BglH site and a unique Xbal site is introduced by linker 

1 5 cloning in the unique AocI site preceding the putative signal sequence cleavage site. The 
resulting plasmid pTNSlS (Figure 15) contains the complete expression cassette for HFBI- 
FLOl fiision protein in which HFBI substitutes the putative lectin domain from Ser-26 to 

Ser-3 19 in the yeast flocculin FLO I (SEQ ID 18). 



20 In the next step, yeast expression vector for production of HFBI-FLOl fiasion protein was 
constructed. The expression vector used as a backbone in the construct is pYES2 
(Invitrogen) (SEQ ID 19) which is a high-copy episomal vector designed for inducible 
expression of recombinant proteins in S, cerevisiae. It carries GALl promoter and CYC I 
terminator sequences which regulate transcription, and 2\i origin of replication and URA3 

2 5 gene for maintenance and selection in the host strain. The plasmid pTNS 1 8 was digested with 

Hindm and the released 3.95 kb fi-agment containing the expression cassette for HFBI-FLOl 
was purified fi-om agarose gel and ligated to pYES2 digested with HindlH. This ligation 
mixture was concentrated by standard ethanol precipitation. The ligation mixture should 
contain besides unligated firagments and uncorrect ligation products also molecules where the 

3 0 vector and insert are correctiy ligated with each other to result in plasmid pTNS23 (Fig. 16) 

which carries the expression cassette for HFBI-FLOl operably linked to GALI and CYCI 
terminator sequences. 
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The above ligation mixture was transformed using the LiAc method of Gietz et ai (1992) 
into a laboratory cerevtsiae strain H452 (wild type W303-1A; Thomas and Rothstein, 
1989). Transformant colonies able to grow on SC-URA plates were picked and streaked on 

5 selective plates. NitroceUtdose replicas were taken from the plates and treated for colony 
hybridization according to Sherman et al (1983). To find those yeast colonies containing the 
pTNS23 plasmid, replicas were hybridized with digoxigenin labelled hfbl coding fragment 
after which an immunological detection was performed all according to the manufacturer 
(Boehringer Mannheim). Piasmids were recovered from several yeast colonies giving positivie 

0 hybridization signal by isolating total DNA and using this in electroporation of E. coli 
Restriction mapping and sequencing were carried out to confirm that the pTNS23 piasmid in 
the yeast transformants was correct. One of the transformants carrying piasmid pTNS23 was 
chosen for fiirther studies and was designated VTT-C-993 15. The control strain for it is yeast 
strain H2 155 which carries the piasmid pYES2 in H452 background. 

5 

Example 22 

Separation of yeast cells expressing HFBI-Flol fusion protein in ATPS 

The Saccharomyces cerevisiae strain VTT-C-993 15 (vector pTNS23) and its control strain 
0 H2155 (vector pYES2) were cultivated on synthetic complete medium lacking uracil (SC- 
URA) (Sherman, 1991) with 2% galactose as the carbon source to give an A^oa of 
approximately 4. Approximately 6.3 x 10* cells in their culture medium were taken to ATPS 
using 7% (w/v) C12-1 8E05 detergent (Agrimul NRE from Henkel) in a total volume of 5 ml 
ATPS was carried out using strandard protocol After phase separation by gravity settling, 
5 the top detergent phase was clearly turbid in the case of the strain VTT-C-993 15 in contrast 
to the control strain whose detergent phase was clear (Figure 17). Samples were taken from 
the top phases and dilution series from 10"^ to 10"^ were prepared in 0.9% NaCl and plated on 
YPD plates. After incubation at 30^C the amount of yeast colonies were calculated showing 
at least 70 times more yeast colonies of the strain VTT-C-993 18 on YPD plates compared to 
0 the control strain. This clearly manifests that also in a system over-loaded with cells, 
separation to the detergent phase of cells expressing a hydrophobin on cell surface occurs. 
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Example 23 

Partitioning of EGIcore-HFBI fusion protein in hydrophobin-containing and non- 
containing pure systems 

5 The efifect of free HFBI and HFBII hydrophobins on the partitioning of EGlcore-HFBI 
was investigated by comparing how efficiently a sample of purified EGIcore-HFBl in 50 
mM acetate buflfer pH 5.0 was extracted to the detergent phase in the presence and 
absence of purified HFBI and HFBIL The extraction was followed by measuring the 
depletion of hydrolytic activity on a soluble substrate such as methylumbelliferyi 

0 cellobioside in the aqueous phase before and after extraction. When 0.7 g/1 of either 
purified hydrophobin was present in the separation of 0.02 g/1 of EGIcore-HFBI with 2% 
C12-18E05 at 30X the extraction was affected in the following way: 93% of the 
protein was extracted when additional hydrophobin was not present, but with HFBII 
present 82% was extracted and with HFBI present 88% was extracted. 

5 

Example 24 

Production of EGIcore-HFBI fusion proteins in T. reesei tshft>2 strain for improved 
partitioning of the fusion protein in ATPS 

^ 

0 Trichoderma reesei strain QM9414 Hahfbl (VTT-D-99726) was transformed essentially as 
described (Penttila etal, 1987) using 10 |xg of the piasmid pMQ113 (described in Example 
1) together with 3 |iig of the selection piasmid pTOC202 containing the amdS gene (Hynes et 
al, 1983); Tilbum et al, 1983) of Aspergillus nidulcms encoding for acetamidase. pMQl 13 
contains an expression cassette for production of EGIcore-HFBI fusion protein under the 

5 control ofcbhl promoter and terminator sequences. 

The Amd-f- transformants obtained were streaked two times onto plates containing acetamide 
(Penttila et al^ 1987). Thereafter spore suspensions were made from transformants grown on 
Potato Dextrose agar (Difco). The production of the EGIcore-HFBI fusion protein was 
0 tested by slot blotting or Western analysis with EGI and HFBI specific antibodies from shake 
flask or microtiter plate cultivations carried out in minimal medium supplemented with Solka 
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flock cellulose. The spore suspensions of the clones producmg fusion protein were purified to 
single spore cultures on selection plates containing acetamide. To determine the best 

producers, production of the fusion protein was analyzed again fi*om these purified clones as 

described above, 

5 

For partitioning e5q)eriments of the EGIcore-HFBI fi:sion protein in ATPS using the 
polyoxyethyiene detergent C12-18E05 the best production strain obtained in this study and 
as control strains VTT-D-98691 (QM9414 strain producing EGIcore-HFBI) and VTT-D- 
74075 (QM9414) were cultivated in shake flasks on medium containing Solka flock cellulose 
10 as described in Example 4. 

Standard partitioning experiments as described in Example 5 are carried out with culture 
supematants. After separation the volume ratio of the lighter and heavier phase is noted and 
the concentration fector for the fusion protein is calculated from it. Samples are also taken 
1 5 fi-om the fighter and heavier phase and analysed with SDS-PAGE, Western blotting and 
activity measurements as described in Example 5. Partition coefficients (K) and yields (Y) are 
calculated as described in Example 5. 



Example 25 

Construction of vectors for expression of EGIcore^BmBH and EGIcore-SC3 fusion 
proteins under the cbhl promoter of T. reesei 



For construction of an expression vector for production of EGIcore-HFBII fiision protein, 
hfb2 (SEQ ID 20 ) coding region (fi-om Ala- 16 to the STOP codon) was amplified by PCR 

2 5 using phfb2 (Nakari-Setala et al. 1 997) as a template and as a 5' primer CGG AGGAGC 

TCG ACG ACT TCG AGC AGC CCG AGC TGC ACG CAG GCT GTC TGC OCT 
ACC GG (SEQ ID 21) and as a 3' primer TCA TTGGATCCT TAG AAG GTG CCG 
ATG GC (SEQ ID 22). The sequence is bold in the 5' primer encodes for amino acids 413- 
425 of EGI and the underlined GAGCTC is a Sad site. The underlined GGATCC in the 3' 

3 0 primer is a BamHI site. The ampUfied fi-agment was digested with Sad and BamHI and 

ligated to pMQl 13 cut with the same restriction enzymes. The resulting plasmid is pTNS32 
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(Figure 18) and it carries the coding sequence for EGIcore-HFBII fusion protein under the 
control of cbhJ regulatory sequences (SEQ ID 5 and SEQ ID 7). 

For construction of an expression vector for production of EGIcore-SC3 fusion protein, SC3 
5 cDNA (SEQ ID 23) was ampified with PGR using cSC3/pUC20 plasmid as a template and as 
a 5' primer ACT ACA CGG A GG AGC TC G ACG ACT TCG AGC AGC CCG AGC 
TGC ACG CAG GOT GGC GAG CCG GGC (SEQ ID 24) and as a 3' primer TCG TAC 
GGATCC TCA GAG GAT GTT GAT GGG (SEQ ID 25). The sequence is bold in the 5' 
primer encodes for amino acids 413-425 of EG! and the underlined GAGCTC is a Sad site. 
1 0 The underlined GGATCC in the 3' primer is a BamHt site. The amplified fragment was 
digested with Sad and Bamffl and ligated to pMQ103 (described in Example 1) cut with the 
same restriction enzymes. The resulting plasmid is pTH4 (Figure 19) and it carries the coding 
sequence for EGIcore-SC3 fusion protein under the control of cbhl regulatory sequences 
(SEQ ID 5 and SEQ ID 7). 

15 

The cSC3/pUC20 plasmid contains the 41 1 bp SC3 cDNA from the translational start site to 
the STOP codon in a pUC20 vector. The translational start site has been constructed to a 
Ncol site and a BamHI site has been added after the translational STOP codon. 

20 Example 26 

Construction of a vector for expression of a HFBI-dCBD fusion protein containing 
hydrophobin I and double cellulose binding domains (CBD) under the cbhl 

promoter of T, reesei 

2 5 For construction of an expression cassette for production of HFBI -dCBD fusion protein 
under cbhi promoter, the protein coding region of hjbi was amplified with PGR using 
pEAlO (Nakari-Setaia et al. Eur. J. Biochem. (1996) 235:248-255) as a template. GGA 
ATT CCG CGG ACT GCG CAT CAT GAA GTT GTT GGG GAT GGC C (SEQ ID 
26) was used as a 5' primer in the PGR and TGA ATT C CA TAT G TT AGG TAC CAC 

30 CGG GGC CCA TGC CGG TAG AAG TAG AAG CCC CGG GAG CAC GGA GGG 
CGG TGT GGC AG (SEQ ID 27) as a 3' primer. The sequence in bold in the 5' primer 
is 1 6 bp of cbhJ promoter adjacent to translational start site of the corresponding gene 
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and the underlined CCGCGG is a Kspl site. The underlined and bold sequences in the 3 ' 
primer are Ndel and Asp718 sites, respectively. The sequence in Italics in the 3' primer 
encodes for a Methionine-containing linker (PGASTSTGMGPGG). The resulting 
fragment of 370 bp was digested with Kspl and Ndel and ligated to pAMHllO 
5 (Nevalainen, K,M.H., Penttila, M,E., Harkki, A., Teeri, T.T. and Knowles, J. (1991) In 
Molecular Industrial Mycology. Eds. Leong, S.A. and Berka, R.. Marcel Dekker. New 
York) digested with same restriction enzymes. The resulting plasmid is pTNS29-2Asp. 

For further cloning steps, an Asp? 18 site was removed from the pTNS29-2Asp 
0 polylinker. The vector was digested with Sad and BamHI and the cleaved vector ends 
were blunted with T4 DNA polymerase and ligated together. The resulting vector 
pTNS29 is missing the Sad, Asp? 18 and Smal sites present in the pTNS29-2Asp 
polylinker. 

5 In the next step, a double ceilose binding domain (dCBD) was amplified with PGR using 
pTNSll as a template. pTNSll carries the coding sequence for a flision protein 
containing 7! reesei CBHII CBD (41 N-terminai residues of CBHII) and CBHI CBD 
linked together via CBHI linker region (CBHI linker and CBD are the last 57 residues in 
CBHI). This DNA sequence originates from an E. coli expression vector B599 which is 

0 essentially the same as the one described by Under et al. (J. BioL Chem. (1996) 
271:21268.21272). TGA ATT CGGTACCCA GGC TTG CTC AAG CGT C (SEQ ID 
28) was used as a 5' primer in the PGR and TGA ATT CCATATGTC ACA GGC ACT 
GAG AGT AGT A (SEQ ID 29) as a 3' primer. The underlined sequences in the 5' and 3' 
primers are Asp? 1 8 and Ndel sites, respectively. The amplified fragment was digested 

5 with Asp? 18 and Ndel and ligated to pTNS29 digested with Asp? 18 and Ndel resulting 
in pTNS30. 

pTNS30 (Figure 20) expression vector thus carries the coding region for a fusion protein 
consisting of HFBI and double CBD linked in frame via the Methionine linker peptide 
0 (PGASTSTGMGPGG). Expression of the fusion protein is regulated by the cbhl 
transcriptional control sequences. The expression cassette may be released from the 
plasmid with EcoRl and SphI 



I 
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Example 27 

Construction of vectors for expression of HFBI-single chain antibody fusion proteins 
under the cbh] promoter of T, reesei 

5 An expression vector was constructed for production of a fusion protein consisting of T 
reesei HFBI protein in the N-terminus and in the C-terminus a single chain antibody 
recognizing a small molecular weight derivative of diarylalkyltriazole (ENASScFv). 
Production of the fiision protein is under the cbhl regulatory sequences. For construction of 

1" *\ 
I » <l 

t» ■* 1^ 

HFBI-ENA5ScFv fusion protein, pENA5ScFv was digested with Ncol and Xbal. The 

*1 rt# 

'1 ■ 

■* -f 

W 1 0 fragment containing the emSscfv gene and the histidine tail (6 x His) was blut-end cloned to 

■«* 
I I 

f;| pTNS2$ resulting in pTHl (Figure 21). pENASScFv vector carries the coding region for 

W . ENA5 single chain antibody consisting of the variable domains of the heavy and light chains 

'< *y 

I! connected via a glycine serine linker and a 6 x histidine tag at the C-terminal end. 
f Transcription and secretion of the single chain antibody are under control of the tac promoter 

fti 15 mdpelB signal sequence, respectively (Takkinen et al, 1991). pTNS29 vector carries the 

t\ ^ rr' 

coding region of T reesei followed by a linker sequence 
l": (ProQlyAlaSerThrSerThrGlyMetGlyProGlyGly) under the control of cbhJ promoter and 

terminator sequences. 

2 0 For construction of HFBI-ENA5ScFv fiision protein with a thrombin cleavage site in the 
linker peptide, enaSscfv coding region (from Ala-23 to the STOP codon) and a peptide linker 
containing the thrombin cleavage site (Gly Thr Leu Val Pro Arg Gly Pro Ala Glu Val Asn 
Leu Val) preceeding it was amplified with PGR using pENASScFv as a template and as a 5' 
primer GAA TTC GGT ACC CTC GTC CCT CGC GGT CCC GCC GAA GTG AAC 

2 5 CTG GTG (SEQ ID 30) and as a 3' primer TGA ATT CCA TAT G CT AAC CCC GTT 

TCA TCT CCA G (SEQ ID 31). The sequence in bold in the 5' primer encodes the first 6 
N-terminal residues of ENASSCFV. The sequence in italics is a thrombin cleavage site and 
underlined GGT ACC is an Asp718 site. The sequence in bold in the 3' primer encodes the 6 
C-terminal residues of ENASScFv and the underlined CA TATG is a Ndel site. The 790 bp 

3 0 PGR fragment was purified from agarose gel and ligated to pTNS29 resulting in pTH2 

(Figure 22). 
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Example 28 

Construction of a vector for production of class I hydrophobin SC3 of ScMzophyllum 
commune in Tnchoderma reesei for ATPS 

5 AT. reesei strmn producing class I hydrophobin SC3 of S, commune was constructed. For 
that purpose an expression vector carrying the SC3 cDNA under the control of hfb2 
promoter and hfbl terminator of T, reesei was constructed. 

The hfbl terminator (SEQ ID 32) was amplified with PGR using as a template pEAlO 
1 0 (Nakari-Setala et al. 1996) and as a 5' primer GAC CTC GAT GCC CGC CCG GGG TCA 
AG (SEQ ID 33) and as a 3' primer GTC GAC ATT TCA TTT TAC CCC CCT CG 
(SEQ ID 34). The underlined sequences in the 5' and 3' primers are Sad and Sail sites, 
respectively. The PCR fragment was cut with Sad and Sail so that the Sad site was 
blunted with Klenow. The fragment was cloned into cSC3/pUC20 (described in Example 
1 5 25) vector digested with Sail and BamHI where the BamHI was blunted with Klenow. In 
the next step, the hfbl promoter (SEQ ID 35) was amplified with PCR using as a template 
pTNS8 (Nakari-Setala et al. 1997) and using as a 5' primer AAG CTT GCA TGC CTG 
CAT CC (SEQ ID 36) and as a 3' primer CCA TGG TGA AAG GTG GTG ATG GTT 
GG (SEQ ID 37). The underlined sequences in the 5' and 3' primers are Hindlll and 
20 Ncol sites, respectively. The PCR fragment was digested with Hindlll and Ncol and 
cloned in front of the SC3 cDNA in the piasmid obtained in the previous step and cut 
with the same restriction enzymes. The resulting piasmid is pKS2 (Figure 23). 

Example 29 

25 Construction of T reesei strains producing EGIcore-HFBlI, EGIcore-SC3, BFBl- 

dCBD and HFBI-single chain antibody fusion proteins, and SC3 hydrophobin 

Tnchoderma reesei strains VTT-D-74075 (QM9414X VTT-D-86271 (Rut-C30) and VTT- 
D-99676 (Rut-C30 hhfbl) were co-transformed essentially as described (Penttila et al, 
3 0 1 987) using 3-13 |xg of the plasmids pTNS32, pTHl, pTH2, pTH4, pTNS30 and pKS2 and 
as the selection plasmids 1-3 jug pToC202 or pARG21. pToC202 carrying the amdS gene 
(Hynes etal, 1983; Tilbum et al, 1983) of ^. nidulans and pAR021 carrying the£. coli hph 
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gene have been described in Example 3. The Anid+ and Hyg+ transformants obtained were 
streaked three times onto plates containing acetamide and hygromycin, respectively (Penttila 
et al, 1987), Thereafter spore suspensions were made from transformants grown on Potato 
Dextrose agar (Difco). 

5 

The production of the fusion proteins EGIcore-HFBII, EGIcore-SC3, HFBI-dCBD and 
HFBI-ENA5ScFv and the SC3 hydrophobin was tested by slot blotting or Western analysis 
with EGI, SC3, CBD and JIFBI specific antibodies fi'om shake flask or microtiter plate 
cultivations carried out in minimal medium supplemented with either lactose or Solka flock 
10 cellulose. The spore suspensions of the fiision protein producing clones were purified to 
single spore cultures on selection plates (containing either acetamide or hygromycin). To 
determine the best producers, production of the fiision proteins was analyzed again firom 
these purified clones as described above. 

15 T reesei strains selected for fiirther cultivations are X46A (pTNS32, host QM9414), VTT- 

D-00793 (pTH4, host Rut-C30 ^hjb2\ VTT-D-99727 (pTNS30, host Rut-C30 ^hfb2\ 

VTT-D-00791 (pTHl, host Rut-C30) and VTT-D-00792 (pKS2, host Rut-C30 hhjhl) 

These strains were cultivated as described in Example 4 and the culture supematants were 
subjected to ATPS. 

20 

Example 30 

Recovery of the protein product from the enriched detergent phase after ATPS by 
isobutanol or other solvents 

25 After phase separation has occurred and the hydrophobin or the hydrophobin fiision 
protein is enriched in the detergent phase (the enriched phase), the protein can be 
recovered to an aqueous buffer by addition of isobutanol or other solvent. For example a 
series, of identical extractions of 0.05 g/1 HFBI in 50 mM acetate buffer was made with 
2% Berol 532. To separate tubes 10% final concentration of either isobutanol, n-amyl 

3 0 alcohol, octanol, or octane was added, and the fi-action of hydrophobin recovered to the 
aqueous phase was analyzed. It was found that 100% was recovered with isobutanol, 



wo 00/58342 



42 



PCT/FTOO/00249 



89% with n-amyl alcohol, 81 % with octanol and 70 % with octane. Fractions were 
analyzed by HPLC as described in Example 38. 

5 Example 31 

Separation of EGIcore-HFBII fusion protein in ATPS. 

T.reesei strain X46A was grown in shake flasks on 3% lactose medium as described in 
Example 4. C12-18E05 detergent was added to a final concentration of 5% to the 
culture supernatant. After mixing the system was allowed to settle and the enriched 

1 0 detergent phase was subjected to fiirther extraction with an equal volume of isobutanoi 
After buflfer exchange on Biogel P-6 (Bio-Rad, USA) gel filtration media, the extracted 
protein was analysed by ion exchange chromatography using a Mono Q (Amersham 
Pharmacia, Sweden) column equilibriated with 1 5 mM acetate and eluted using a linear 
gradient of NaCL Separation of the EGIcore-HFBII fusion protein to the detergent phase 

1 5 was verified by analyzing the eluted peak fraction from the Mono Q chromatography run 
by its activity on 4-methylumbellifefyl cellobioside as well as its re-extraction by ATPS. 

Example 32 

Separation of EGIcore-SC3 fusion protein in ATPS 

20 

Culture filtrate of Zreesei strain VTT-D-00793 producing EGIcore-SC3 fusion protein 
was subjected to APIS by adding S% final concentration of CI2-18E05. After the 
settling, the phases were separated and the fiision protein enriched detergent phase was 
fiirther treated with an equal volume of isobutanoi which removes the detergent to the 
25 isobutanoi and leaves the protein in the aqueous phase. The resulting fusion protein 
enriched aqueous phase was then desalted on Biogel P-6 (Biorad, USA). Separation of 
fusion protein to the detergent phase was verified by analyzing the en2ymatic activity of 
the EGIcore fusion partner on 4-methylumbeiIiferyl cellobioside in that phase. 

30 
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Example 33 

Separation of HFBI-dCBD fusion protein in ATPS 



Treesei strain VTT-D-99727 producing HFBI-dCBD fusion protein was grown on 
5 lactose-containing medium as described in Example 4. Berol 532 was added to a final 
concentration of 2 % to 500 ml of culture supernatant. The mixture was allowed to settle 

and the phases separated in a separation funnel The enriched top phase (10 ml) was 
extracted with an equal volume of isobutanol and 50 mM acetate buffer pH5, The fusion 
protein was selectively enriched to the buffer in the separation as shown by SDS gel 
1 0 electrophoresis (Figure 24). 



Example 34 

Recovery of the HFBI-dCBD fusion partners after chemical cleavage 

15 HFBI-dCBD protein produced by the strain VTT-D-99727 has a methionine 
(PGASTSTGMGPGG) designed in the linker region between the HFBI and the dCBD, 
which would enable the recovery of the native HFBI and dCBD after chemical cleavage with 
CNBr. 

2 0 The HFBI-dCBD fusion protein was purified as described in Example 33 . The resulting water 
phase (-108 ml) still contained a small amount of CBHI and free hydrophobin in addition to 
the HFBI-dCBD fusion (Figure 24). Further purification of the sample was performed by 
chromatography. The sample was desalted on a Biogel P-6 column equilibrated with 50mM 
sodium acetate buffer pH 5.5, diluted 1 + 3 with water and applied on a CM-Sepharose FF 

2 5 column equilibrated with lOmM sodium acetate buffer pH 5.5. The CBHI protein was found 

in the flow through and the purified HFBI-dCBD protein was eluted with 0.2M NaCl. 

A sample of purified HFBI-dCBD protein was evaporated in speed-vac almost to dryness. 
CNBr cleavage was performed by adding a solution of CNBr in 0. IM HCl (5mg/ml) to the 

3 0 evaporated sample in 5 times weight excess. The reaction was allowed to continue for 24h at 

room temperature (in dark). A 10 times volume of water was added and the sample was 
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evaporated in speed-vac almost to dryness. The evaporated sample was diluted in 50mM 
Tris-HCl pH 7 and the cleavage was confirmed by HPLC analysis as described in Example 
37. 

5 The treatment was further optimised by using different concentrations of HCl and incubation 
times. The final CNBr treated sample was subjected to ATPS as described in Example 33. 
HFBI was found in the top-phase and the dCBD in the bottom phase, thus leading to a 
separation of both fusion partners. 

1 0 Example 35 

Separation of HFBI-single chain antibody fusion protein in ATPS 

Strain VTT-D-00791 was cultivated for 7 days ^ on medium containing 10 g/\ potassium 
phtalate, 15 g/l KH2PO4, 5 g/L (NH4)2S04 and 3% lactose and 0.2% peptone. 30 ml of 
15 culture medium was subjected to ATPS using 4% C12-18-E05 detergent as described in 
Example 5. After removal of the bottom phase, the detergent phase was extracted by 
isobutanol. Proteins fi-om the water phase after detergent extraction and the bottom phase of 
the isobutanol extracted detergent phase were precipitated with trichloroacetic acid (final 
concentration 10%), resuspended in SDS-PAGE sample buffer (taking into acount the 

2 0 concentration factor of the isobutanol extracted detergent phase) and analysed by Western 

blotting using HFBI specific antibodies. The fusion protein runs as a dimer (approximately 70 
kDa ) in SDS-PAGE under the conditions used as seen in Figure 25. The water phase sample 
stili contained a small amount of fiision protein. Nevertheless, separation of the ^sion protein 
can be obtained by ATPS. 

25 

Example 36 

Separation of SC3 hydrophobin in ATPS 

SC3 hydrophobin producing strain VTT-D-00792 was cultivated in shake flasks on laaose- 

3 0 containing medium as described in Example 4. Soluble secreted proteins produced by the 

fimgus were precipitated with trichloroacetic acid. Precipitated proteins were solubiiized in 
trifluoroacetic acid to dissociate the SC3 aggregates whereafter the acid was evaporated with 
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a stream of air. The treated proteins were solubilized in 2 ml water containmg 2.5 % of the 
detergent C12-18E05. Phase separation was carried out as described in Example 5. Samples 
were taken from the lighter and heavier phase and analysed by Western blotting using SC3 
specific antibodies. On the basis of this analysis, SC3 hydrophobia partitions to the detergent 
5 phase (Figure 26) . 

Example 37 

Separation of nisin in ATPS 

10 1 mg of pure nisin (Sigma) (equals to 1000 lU) in 50 mM sodixm acetate buffer pH 5 was 
subjected to ATPS using 2% of the detergent C12-18E05 at 30°Q and the phases were 
allowed to settle. After removal of the bottom phase, the detergent phase was extracted by 
isobutanol which removes the detergent to the isobutanol and leaves nisin in the aqueous 
phase. Separation of nisin to the enriched aqueous phase was assayd testing nisin 

15 bioactivity as described by Qiao et al. 1996. Comparison of the halos on the assay plate 
produced by the enriched phase and the nisin controls indicated that nisin had separated 
to the detergent phase with a concentration factor of approximately 5. 

2 0 Example 38 

Preparative purification of HFBI and HFBn in ATPS 

For preparative purposes, HFBI and HFBI were extracted from 0.5 1 of culture 
supemantants from cultivations carried out on media containing glucose or cellulose as 

2 5 described in Example 4. 2% (w/w) detergent was added to the culture supernatant which 
was mixed and then allowed to settle in a separation funnel at 20^C if C11E02 (Berol 
532) was used and SO'^C if CI2-18E05 (Henkel) was used. The detergent phase 
(enriched phase) was collected and mixed with an equal volume of isobutanol. When 
C1IE02 was used an equal volume of 50 mM acetate buffer was also added. The 

30 remaining culture supernatant is the depleted phase. To follow the purification, analytical 
HPLC was run on samples from each step (Figure 27and 28). On the basis of HPLC 
analysis, both HFBI and HFBII partitioned well to both of the detergents. When 
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C11E02 was used, K values of >1000 and >78 were obtained for HFBI and HFBII, 
respectively. 

Water phase after isobutanol extraction was collected and furher purified on a Vydac 
5 1x20 cm semipreparative C4 column (Vydac, USA) eqilibriated with 0.1% trifluoro 
acetic acid in water and eluting with a linear gradient of 0.1% trifluoro acetic acid in 
acetonitrile. The proteins were then lyophilized. In analytical runs a 4,6 mmx20 cm 
Vydac C4 column was used. 

1 0 Example 39 

Screening for separation conditions using purified HFBI and HFBII 

HFBI and HFBII which were purified from culture superaatants in ATPS, further 
purified by preparative HPLC and lyophilized as described in Example 38 were dissolved 

15 in water to a concentration of 0.5 mg/ml. The stock solutions were then diluted with the 
buffer appropriate for the experiment. 50 mM acetate buffer pH5, 50 mM glycine pH 3, 
50 mM HEPES pH 7 was used for pH screening and NaCl or (NH4)2S04 in acetate 
buffer was used for ionic strength experiments. The surfactant was weighed with the 
appropriate amount of hydrophobin solution and incubated in 0, 20, 30, 40°C and the 

20 amount of hydrophobin in the depleted phase determined. Partitioning coefficients were 
calculated from the change in concentration and the volume ratios and are presented in 
the tables below. 

25 



30 
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Table presenting K-values for the separation of hy drophobins with different surfactants 
and temperature. 





Temperature 


Henkel 


Beroi 532 


Berol 266 


Triton xll 4 


HFBI 


OX 


0,1 


368 


NP 


NP 








1961 


NP 


NP 




30^C 


85 


2970 


NP 


0,9 




40X 


148 


682 


28 


69 


HFBn 


0°C 


NP 


76 


NP 


NP 




20X 


1,0 


139 


NP 


NP 




30X 


67 


194 


NP 


4,0 




40X 


44 


102 


19 


44 



5 NP = no phase separation 



Table presenting the effect of different salt concentrations at 40*^C on K-values 

10 







Henkel 


Berol 532 


Berol 266 


Triton xl 14 


HFBI 


50 rriM buffer 


244 


578 


13 


39 




+ 1 MNaCI 


239 


33 


198 


241 




+1 M {NH4)2S04 


0,9 


3,2 


176 


612 














HFBII 


50 mM buffer 


36 


80 


16 


58 




+ IMNaCl 


14 


87 


61 


38 




+1 M (NH4)2S04 


3,1 


351 


95 


35 



Table presenting the K-values for the partitioning of hydrophobias in Berol 532 at 
1 5 different conditions. 





HFBI OT 


HFBI 20X 


HFBE O^C 


HFBII 20°C 


50 mM acetate pH5 


298 


614 


76 


127 


+lMNaC} 


6i4 


3781 


131 


199 


+1M (NH4)2S04 


931 


44 


203 


139 


pH3 


614 


1881 


73 


160 


pH7 


361 


614 


54 


126 
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Example 40 

Purification of a new putative T. reesei hydrophobin on the basis of its separation 
ATPS, and cloning of the corresponding gene 



5 7. reesei strain VTT-D-99726 (QM9414 bJifbl) was cultivated on lactose-based 
medium in 15 fermentor as described in Example 4. After cultivation, H of the culture 
filtrate was subjected to ATPS with 5% polyoxyethylene detergent C12-18E05, After 
phase separation at 30°C, detergent phase was separated from the depleted aqueous 
phase and analysed by SDS-PAGE. The analysed sample contained a protein which 

0 was about 7,5 kDa. In addition, some larger proteins were present. These were 
removed by hydrophobic interaction chromatography on a Phenyl-Sepharose FF 
column equilibriated with 50 mM sodium acetate - 1 M (NH4)2S04 buffer. The 
proteins were eluted with descending (NH4)2S04 salt gradient. Fractions containing 
the ca. 7.5 kDa protein were pooled and concentrated, and analysed by mass 

5 spectrometry. This revealed that the purified protein was degraded to three peptides 
(2486, 2586 and 2709 Da), which were held together by disulphide linkages. The N- 
terminal amino acid sequence from the 2486 Da peptide was 
ANAFCPEGLLYTNPLCCDLL, which on the basis of position of cysteines and on 
sequence comparison to known hydrophobins is typical to a hydrophobin. 

0 

Degenerated primers were designed on the basis of this amino acid sequence as well 
as on the basis of the sequence obtained from the 2586 Da peptide. These primers 

were used m RT-PCR (RobusT RT-PCR Kit, Finnzymes) with RNA isolated from the 
same cultivation from where the protein was purified. The 140 bp fragment obtained 
5 from the RT-PCR reaction was sequenced. The obtained sequence contained a part 
that was coding for the 2486 Da peptide used to design the PGR primer confirming 
that the PGR product corresponds to the purified protein. 
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The strain VTT-D-99702 producing EGIcore-HFBI fusion under the chhl promoter was 
cultivated in pilot scale fermentor in 1 200 L on minimal medium containing 4% lactose, 
0.4 % peptone and 0.1% Yeast Extract for 4 days. Cultivation temperatures were 28''C 
or alternatively the temparature was changed during the cultivation from 27°C to 22°C in 
a step-wise mamier. Production level of the fusion protein was a few grams per liter 
After the end of the cultivation, the mycelium was separated by means of a rotating 
vacuum drum filter using Celite 535 diatomaceous earth as filter aid. 

In separation 1, 1100 L of the supernatant was transferred into the meanwhile cleaned 
bioreactor, adjusted to the separation temperature of 24,7 and mixed with salt and 
detergent to give 0J5 M ammonium dihydrogen phosphate and 4,1 % of detergent C12- 
18E05. The phases were left to separate by gravity settling and the heavier phase 
removed through the bottom valve. A parallel experiment in 10 ml volume was carried 
out to investigate the effect of scale up on the separation of the fiision protein. The 
partition coefficient and the concentration factor were equal for the 10 ml and 1200 L 
separation within the measurement error. The separated detergent phase was re- 
extracted using the detergent phase from the first extraction by replacing half of the 
volume of the bottom phase by tap water. The salt concentration was adjusted to 0,25 M 
ammonium dihydrogen phosphate at 30°C. K and Y values of the separations are 
presented in the below Table. 

Alternatively in separation 2, 10 mi of drum filtrated culture medium samples were 
subjected to ATPS at 25°C using 4.1% C12-18E05 detergent and 1.15 M {NH4)H2P04. 
K and Y values after one separation step are presented in the Table below. 
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Table presenting extraction experiments of culture supernatants from different pilot scale 
cultivations 



Separation 


Cultivation 
temperature 


Concentration factor 


K 


Y [%] 


Separation 1 


2TC 22°C 


3,4 


3.8 


59 


Re- extraction 






8 


70 


Separation 2 


29^C 


2.8 


8.3 , 


81 



Example 42 

Separation of EGI, EGIcore-HFBI fusion protein and HFBI hydrophobin in robust 
micelle/polymer systems 

10 Separations were performed with purified samples. HFBI was purified by two-phase 
separation and HPLC as decribed in Example 38. EGIcore-HFBI was purified by two-phase 
extraction as decribed in Example 5 followed by desalting on a Biogel P-6 column 
equiliberated with 20niM sodium acetate puffer pH 6 containing 150mM NaCl. The purified 
proteins were subjected to ATPS using different micelle/polymer systems. 

15 

The table below presents partitioning coefficients of pure EGI, HFBI and EGIcore-HFBI 
at constant tie-line length, in the different systems, Triton X-114/water system (4.1 % 
wt.) at 31.5^ C, Triton X-lOO/Reppal/water system (8.1 % wt. / 8.2 % wt.) and Triton 
X-114/Reppal/water system (5.0 % wt. / 4.0 % wt.) at 2P C. The standard deviation is 
2 0 shown within the brackets. The systems were buffered with 25 mM sodium acetate, pH 

4.0. K > 1 is equivalent to a preferred protein partitioning to the micelle rich phase. 





EGI 


HFBI 




EGIcoi«-HFBI 




Kegi 


Khfb 


Kegihfb Yield (%) 

top pha.se 


TritonX-il4 


0.6 (0.1) 


21.5 (1.5) 


tA (1.0) 


85 


Triton X-iOO/Reppal 


0.9 (O.I) 




1.7 (0.1) 


56 


Triton X-114/Reppal 


0 7 (O.I) 


11.0 (1.1) 


15.4 (2.9) 


91 
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Claims 

1 . A process for partitioning of proteins or cells in aqueous two-phase systems (ATPS), 
comprising the steps of 

a) in order to obtain a fusion protein or cell, combining a protein or a cell of interest 
to a targeting protein selected from the group consisting of amphipathic and hydrophobic 
proteins having the ability to partition in ATPS and to carry said protein or cell of 
interest into one of the phases, and 

b) subjecting said fusion protein or cell carrying the targeting protein to an ATPS 
separation. 

2. The process according to claim 1, wherein the targeting protein is a hydrophobin or 
hydrophobin-like protein or a part thereof 

3 . The process according to claim 2, wherein the hydrophobin is a Trichoderma 
hydrophobin or a part thereof 

4. The process according to claim 3, wherein the Trichoderma hydrophobin is HFBI, 
HFBII or SRHI ,or a part thereof 

5. The process according to claim 1, wherein the amphipathic or hydrophobic proteins 

or parts thereof form aggregates. 

6. The process according to any one of claims 1 to 5 for partitioning cells in ATPS, 
wherein in step a) of claim 1 the combination of the cells of interest to the targeting 
protein comprise bringing said targeting protein onto the surface of said cells. 

7. The process according to claim 6, wherein the cells are yeast ceils. 

8. The process according to claim 6 wherein the cells are spores. 

9. The process according to any one of claims 6 to 8, wherein the targeting protein is 

fiised to a protein which brings the targeting protein onto the surface of the cell. 
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1 0. A fusion protein, comprising a hydrophobin or hydrophobin-like protein as defined in 
any one of claims 2 to 5 fused to a protein of interest. 

11. The fusion protein according to claim 10, wherein the protein of interest is a cell- 
bound protein or a part thereof 

12. The fusion protein according to claim 10, wherein the protein of interest is an 
extracellular protein or a part thereof 

13. The fusion protein according to claim 12, wherein the extracellular protein is an 
extracellular protein of Trichoderma, selected from the group consisting of cellulases, 
hemicellulases and proteases. 

14. The iusion protein according to claim 10, wherein the protein of interest is an 
antibody protein or a part thereof 

15. The process according to claim 1, wherein the targeting protein is fused to the 
protein of interest according to any one of the claims 10 to 14. 

16. A recombinant organism producing a fusion protein according to any one of claims 
10 tol4. 

17. The recombinant organism according to claim 16, wherein the organism has been 
genetically modified to be capable of producing a fusion protein according to any one of 
claims 10 to 14. 

1 8. A recombinant DNA molecule, comprising a DNA molecule encoding a fiasion 
protein according to any one of claims 10 to 14. 

19. A process for producing a fusion protein according to any one of the claims 10 to 14 
with recombinant organisms, comprising the steps of 

a) transforming the organism with DNA molecules enabhng expression of such proteins, 
and 
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b) recovering such protein from the culture of the recombinant organism. 



20. The process according to any one of claims 1 to 9 and 1 5, wherein the aqueous two- 
phase system is selected from the group consisting of PEG/salt, PEG/Dextran and 
PEG/starch systems or derivatives thereof, detergent-based aqueous two-phase systems 
and thermoseparatmg polymer systems. 

21 . The process according to claim 20, wherein the detergent-based ATPS comprises a 
detergent which is selected from the group consisting of nonionic and zwitterionic 
detergents. 

22. The process according to claim 20, wherein the thermoseparating polymer system 
comprises a polymer which is selected from the group consisting of polyethylene- 
polypropylene copolymers. 

23. The process according to any one of claims 1-9 and 15, wherein the protein or cell of 
interest is separated from a suspension containmg cells or cell extracts. 

24. A process for separating hydrophobins or hydrophobin-like proteins or parts thereof 
in aqueous two-phase systems, comprising the steps of 

a) mixing solutions containing said hydrophobin, hydrophobin-like protein or parts 
thereof with the phase forming chemicals, and 

b) carrying out ATPS separation, 

wherein the aqueous two-phase system is as defined in any one of claims 20 to 22. 
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SEQUENCE LISTING 
<110> Valtion teknillinen tutkimuskeskus 
<120> Process for partitioning of molecules 
<130> 31805 
<140> 

<160> 42 

<170> Patentin Ver . 2.2 

<210> 1 

<211> 428 
<212> DNA 

<213> Trlchoderma reesei 
<220> 

<221> intron 
<222> (167) . . (236) 

<220> 

<22l> intron 

<222> (323) . . (386) 
<220> 

<223> Coding sequence of hfhl 
<400> 1 

atgaagttct tcgccatcgc cgctctcttt gccgccgctg ccgttgccca gcctctcgag 6 0 
gaccgcagca acggcaacgg caatgtttgc cctcccggcc tcttcagcaa cccccagtgc 12 0 
tgtgccacGc aagtccttgg cctcatcggc cttgactgca aagtccgtaa gttgagccat 180 
aacataagaa tcctcttgac ggaaatatgc cttctcactc ctttacccct gaacagcctc 24 0 
ccagaacgtt tacgacggca ccgacttccg caacgtctgc gccaaaaccg gcgcccagcc 3 00 
tctctgctgc gtggcccccg ttgtaagttg atgccccagc tcaagctcca gtctttggca 360 
aacccattct gacacccaga ctgcaggccg gccaggctct tctgtgccag accgccgtcg 420 
gtgcttga 42 8 



<210> 2 

<211> 78 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: PCR 5' primer 
<400> 2 

tcgggcacta cgtgccagta tagcaacgac tactactcgc aatgccttgt tccgcgtggc 60 
tctagttctg gaaccgca 70 
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<210> 3 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 3' primer 
<400> 3 

tcgtacggat cctcaagcac cgacggcggt 30 



<2lO> 4 
<21l> 63 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: PCR 5' primer 

<400> 4 

actacacgga ggagctcgac gacttcgagc agcccgagct gcacgcagag caacggcaac 60 



<210> 5 
<21l> 2211 

<212> DNA 

<213> TTxchodiexjaa. jreesel 
<220> 

<221> promoter 
<222> (1) . . (2211) 

<223> chhl promoter sequence 
<400> 5 

gaattctcac ggtgaatgta ggccttttgt 

acctccatta cgcctccccc atagagttcc 
tagattgggg agaagttgac ttccgcccag 
ggtcggcaac ggcaaaaaag cacgtggctc 
tccaggaacc tggatacatc catcatcacg 
tattcgccct aaaccgaagt gcgtggtaaa 
gtgtgtcttc tctaggtgca ttctttcctt 
tccgagctgt aactacctct gaatctctgg 
tgcatcatgt atataatagt gatcctgaga 
t99tcatcaa acaaagaacg aagacgcctc 
agaactggat acttgttgtg tcttctgtgt 
tctattcaaa caccaagctt gctcttttga 

agttgtgaag tcggtaatcc cgctgtatag 
ctgctgcgaa cccggagaat cgagatgtgc 
catgaaaggc tatgagaaat tctggagacg 
acaagcaaag cgttccgtcg cagtagcagg 
taagtagcga tggaaccgga ataatataat 
caatgcaggg gtactgagct tggacataac 
ggcgtttccc tgattcagcg tacccgtaca 
ggacgtgttt tgcccttcat ttggagaaat 
tgaccgactg gggctgttcg aagcccgaat 
aggcatgttg tgaatctgtg tcgggcagga 



sgggtaggaa ttgtcactca agcaccccca 60 
caatcagtga gtcatggcac tgttctcaaa 12 0 
agctgaaggt cgcacaaccg catgatatag 18 0 
accgaaaagc aagatgtttg cgatctaaca 24 0 

cacgaccact ttgatctgct ggtaaactcg 300 

tctacacgtg ggcccctttc ggtatactgc 36 0 
cctctagtgt tgaattgttt gtgttgggag 420 
agaatggtgg actaacgact accgtgcacc 48 0 
aggggggttt ggagcaatgt gggactttga 54 0 

ttttgcaaag ttttgtttcg gctacggtga 600 
atttttgtgg caacaagagg ccagagacaa 660 
gctacaagaa cctgtggggt atatatctag 720 
taatacgagt cgcatctaaa tactccgaag 78 0 
tggaaagctt ctagcgagcg gctaaattag 84 0 
gcttgttgaa tcatggcgtt ccattcttcg 900 
cactcattcc cgaaaaaact cggagattcc 96 0 
aggcaataca ttgagttgcc tcgacggttg 102 0 
tgttccgtac cccacctctt ctcaaccttt 1080 
agtcgtaatc actattaacc cagactgacc 1140 
aatgtcattg cgatgtgtaa tttgcctgct 12 00 

gtaggattgt tatccgaact ctgctcgtag 1260 
cacgcctcga aggttcacgg caagggaaac 1320 
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caccgatagc agtgtctagt agcaacctgt aaagccgcaa tgcagcatca ctggaaaata 1380 
caaaccaatg gctaaaagta cataagttaa tgcctaaaga agtcatatac cagcggctaa 144 0 
taattgtaca atcaagtggc taaacgtacc gtaatttgcc aacgcgttgt ggggttgcag 1500 
aagcaacggc aaagcccact tcccacgttt gtttcttcac tcagtccaat cccagctggt 1560 
gatcccccaa ttgggtcgct tgtttgttcc ggtgaagtga aagaagacag aggtaagaat 1620 
gtctgactcg gagcgttttg catacaacca agggcagtga tggaagacag tgaaatgttg 168 0 
acattcaagg agtatttagc cagggatgct tgagtgtatc gtgtaaggag gtttgtctgc 174 0 
cgatacgacg aatactgtat agtcacttct gatgaagtgg tccatattga aatgtaagtc 1800 
ggcactgaac aggcaaaaga ttgagttgaa actgcctaag atctcgggcc ctcgggcttc 1860 
ggctttgggt gtacatgttt gtgctccggg caaatgcaaa gugtggtagg atcgaca'^ac 1920 
tgctgccttt accaagcagc tgagggtatg tgataggcaa atgttcaggg gccactgcat 1980 
ggtttcgaat agaaagagaa gcttagccaa gaacaatagc cgataaagat agcctcatta 204 0 
aacgaaatga gctagtaggc aaagtcagcg aatgtgtata tataaaggtt cgaggtccgt 2100 

gcctccctca tgctctcccc atctactcat caactcagat cctccaggag acttgtacac 2160 
caccttttga ggcacagaaa cccaatagtc aaccgcggac tgcgcatcac g" 2211 



<210> 6 
<2ia> 1588 
<212> DNA 

<213> Trxchodezwa reesei 



<220> 

<223> T, reesei egll cDNA 



<400> 6 

cccccctatc ttagtccttc ttgttgtccc 
accacggcca tcctggccat tgcccggctc 
cccgaggtcc atcccaagtt gacaacctac 
caggacacct cggtggtccc tgactggaac 
tcgtgcaccg tcaacggcgg cgtcaacacc 
aagaactgct tcatcgaggg cgtcgactac 
agcctcacca tgaaccagta catgcccagc 
cggctgtatc tcctggactc tgacggtgag 
ctgagcttcg acgtcgacct ctctgctctg 
tctcagatgg acgagaacgg gggcgccaac 
agcggctact gcgatgctca gtgccccgtc 
agccaccagg gcttctgctg caacgagatg 
gccttgaccc ctcactcttg cacggccacg 
ccctatggca gcggctacaa aagctactac 
accttcacca tcatcaccca gttcaacacg 
agcatcaccc gcaagtacca gcaaaacggc 
gacaccatcc cgtcctgccc gcccgcctca 
gccctgagca gcggcatggt gctcgtgttc 
aactggctcg acagcggcaa cgccggcccc 
atcctggcca acaaccccaa cacgcacgtc 
gggtctacca cgaactcgac tgcgcccccg 

actacacgga ggagctcgac gacttcgagc 
cagtgcggtg gcattgggta cagcgggtgc 
tatagcaacg actactactc gcaatgcctt 
gacgggggca cgatagaatg cgggcacgca 
aagacatgct atgttgtatc tacattagca 
agcaaaaaaa aaaaaaaaaa aaaaaaaa 



aaaatggcgc cctcagttac actgccgttg 6 0 
gtcgccgccc agcaaccggg taccagC7r:c 120 
aagtgtacaa agtccggggg gtgcgtggcc 180 
taccgctgga tgcacgacgc aaactacaac 24 0 
acgctctgcc ctgacgaggc gacctigtggc 3 00 
gccgcctcgg gcgtcacgac ctcgggcagc 360 
agctctggcg gctacagcag cgtctctcct 420 
tacgtgatgc tgaagctcaa cggccaggag 48 0 
ccgtgtggag agaacggctc gctctacctg 54 0 
cagtataaca cggccggtgc caactacggg 6 00 
cagacatgga ggaacggcac cctcaacact 660 
gatatcctgg agggcaactc gagggcgaat 72 0 
gcctgcgact ctgccggttg cggcttcaac 780 
ggccccggag ataccgttga cacctccaag 84 0 
gacaacggct cgccctcggg caaccttgtg 900 
gtcgacatcc ccagcgccca gcccggcggc 960 
gcctacggcg gcctcgccac catgggcaag 1020 
agcatttgga acgacaacag ccagtacatg 1080 
tgcagcagca ccgagggcaa cccatccaac 114 0 
gtcttctcca acacccgctg gggagac . :t 1200 
cccccgcctg cgtccagcac gacgttttcg 1260 
agcccgagct gcacgcagac tcactggggg 1320 
aagacgtgca cgtcgggcac tacgtgccag 138 0 
tagagcgttg acttgcctct ggtctgtcca 144 0 
gggagctcgt agacattggg cttaatatat 150 0 
aatgacaaac aaatgaaaaa gaacttatca 1560 

1588 
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<210> 7 
<211> 745 
<212> DNA 

<213> Trichoderma reesei 



<220> 

<221> terminator 

<222> (1) . . (745) 

<223> r. reesei cbhl terminator 



<400> 7 

ggacctaccc agtctcacta cggccagtgc 
tgcgccagcg gcacaacttg ccaggtcctg 

tccgtgcgaa agcctgacgc accggtagat 
ggagctacat ggccccgggt gatttatttt 
atacggtcaa ctcatctttc actggagatg 
ttggcaaatt gtggctttcg aaaacacaaa 
taacggaata gaagaaagag gaaattaaaa 
cccgtagaat cgccgctctt cgtgtatccc 
caatgttgat attgttccgc cagtatggct 
cgaacgcggt agtggctgct gccaattggt 

gcaacagtgg aaattagtgg cgcaataatt 
tggaaagcac tgttggagac caacttgtcc 

acgatgacaa cgtagccgag gaccc 



ggcggtattg gctacagcgg ccccacggtc 60 
aacccttact actctcagtg cctgtaaagc 120 

tcttggtgag cccgtatcat gacggcggcg 160 
ttttgtatct acttctgacc cttttcaaat 24 0 

cggcctgctt ggtattgcga tgttgtcagc 3 00 
acgattcctt agtagccatg cattttaaga 360 
aaaaaaaaaa aacaaacatc ccgttcataa 420 

agtaccacgt caaaggtatt catgatcgtt 480 
ccacccccat ctccgcgaat ctcctcttct 54 0 
aatgaccata gggagacaaa cagcataata 600 
gagaacacag tgagaccata gctggcggcc 66 0 
gttgcgaggc caacttgcat tgctgtcaag 720 

745 



<210> 8 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: annealed primer 

<400> fl 
taaccgcggt 



<210> 9 
<211> 16 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: annealed primer 
<400> 9 

ctagaccgcg gttaat 



<210> 10 
<2ll> 1232 
<212> DNA 

<213> Trichoderma reesei 
<220> 

<221> promoter 
<222> (1) . . (1232) 
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<22 3> T. jreesei gpdl promotor 
<400> 10 

gtcgacacga tatacaggcg cggctgatga 
stgtgacaat attgactgcg aggaaccatc 
caccaaggtg catgcatcat aaggattctc 
gttagtgaag gcaccgtgat ggcaagatag 
cagtacgtca agtaacgtca aagccgccct 
gattgcctca atcacatagt acctacctat 
agattgagag ctaccttaca tcaatatggc 
ccccagccgg cgcgattgtg tgtactaggt 
tgcttggcaa tcgtgctcag ctgtcaggtt 

gcagttgcca aggtacctag ggagggattc 
taggatggcg gctggcgggg ccgaagctgg 
aagttgatga taccgttttg ccagattaga 
gatgccgcat caccgttgtg tcagaaacga 
ccactatctc aggtaaccag gtactaccag 
gtgttaatct ttccacctcc ctcctcaatc 

cttcctcctc ttcttctcca taaccattcc 
agcctcgcaa atcctcagtt tgtatgtacg 
gcccgacgat gcggcttctg ttcgcctgcc 
gccccgccag gactctcctg cgtcaccaat 
tccctctcgt ttcttcctaa caaacaacca 

cacgcagctc aatccgcaga tacaaatcta 



taatgatgac cgagcatgac ttgatgctgt 6 0 
aggtgtgtac ggatggaacc attctgtaac 12 0 
ctcagctcac caacaacgaa cgatggccat 18 0 
aaccactatt gcatctgcgc ttcccacgca 24 0 
cccgtaacct cgcccgttgt tgctccc jc 3 00 
gcattatggg cggcctcaac ccaccccccc 360 
cagcacctct tcggcgatac atactcgcca 420 
aggctcgtac tataccagca ggagaggtgc 4 80 

gtacntgtat ggtacttgta aggtggtcat 540 
aacgagccct gcttccaatg tccatctgga 6 00 
gaactcgcca acagtcatat gtaatagctc 660 
tgcgagaagc agcatgaatg tcgctcatcc 72 0 
ccaagctaag caactaaggt accttaccgt 78 0 
ctaccctacc tgccgtgcct acctgcttta 84 0 
ttcttttccc tcctctcctc tttttttttt 900 
taacaacatc gacattctct cctaatcacc 960 
tacgtactac aatcatcacc acgatcgtcc 1020 
cctcctctca ctcgtgccct tgacgagcta 1080 
ttttttccct atttacccct cctccctctc 114 0 
ccaccaaaat ctctttggaa gctcacgact 1200 
5a 1232 



<210> 11 
<211> 1129 
<2l2i. DNA 

<213> Trichoderma reesei 

<220> 

<221> terminator 

<222> (1) . . (1129) 

<223> T, reesei gpdl terminator 

<400> 11 

ggatcccgag cattgtctat gaatgcaaac 
atgacgaata gagccaatct gctccacttg 
ccttgactgc ccattcaaac aattgtaaag 
gcgtgcgagc ccgtttgtac gttattttga 
cttggcttac gatcatgttt gcgatcttcg 
atcgcttcta tgattaggta ctcagacaac 
aggtcgattc caatcatgat ctggactggg 
agaaggaatg gtaggacctc tcaggcgtcc 
ccaaaagcac ctacctatct acctagagta 
agaatgatct caagaggccg tgaacgcaga 
aacaaaaaaa gaaagatgag gactagaaga 
tgaggtgacc tgtcagggtt taccgcaaac 
ccgcgcctcg cctcttctct cctgtccttt 
gaagcggggt tatcgacgcc accaaagatc 
atcaaacagc ccccgagtga tcagatccgt 
accgaggaac aataaagcga gtaaataaca 
gttcctcaat caatcccaac ccccctaaaa 
aggagagaga tatttccact accgctcacc 
gaatcagaag tgacaacaac gggttgagca 



aaaaatagta aataaatagt aattctggcc 6 0 
actatcttgt gactgtatcg tatgtcgaac 120 

gaatatagct acaagttatg tctcacgttt 18 0 
gaaagcgttg ccatcacatg ctcacagtca 24 0 
gtaagaatac acagagtaac gattatctcc 300 
acatgggaaa caagataacc atcgcatgca 360 
gtattccatc taagccatag taccctcv^ag 420 
accatctgtg ctgcaaatcc aagaaacccc 480 
actgcacgag aaaagaaaag gagcagaaga 54 0 
aacacactcc tcccaacttt tcaagttttg 600 
tggagtattt ccttcttaga gagctctcgg 660 
cgtcggtggt tctatccaat taatcaagtc 720 
catagaatcc cgtctccttg ttgcttgatc 780 
ttgtcttggt gacttatcaa tcctttggtg 84 0 
aaaagaagaa gaagagtacg atttaaccag 900 
tcaaaataag agtctcgttg aaaattactt 96 0 

gcccttcccc ccatggtata tcccggcagt 102 0 
accaagtgag gcttgccgag agaagaggat lOBO 
catgggatat cggcgcgcc 1129 
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<210> 12 
<211> 5733 
<2X2> DKA 

<213> Aspergillus nldulans 
<220> 

<223> (1-5733) Sequence of plasmid pAN52-l 
<220> 

<221> promoter 

<222> (1) , . (2129) 

<223> A. nidulaxis gpdA promoter 

<220> 

<221> gene 

<222> (2130) . . (2304) 

<22 3> A. nidulaxis gpdA gene 

<220> 

<22i> terminator 

<222> {2305) . . (3071) 

<223> A. nidulans trpC terminator 

<220> 

<221> misc_feature 

<222> (3072) . . (5726) 

<223> pUC18 from Sail to EcoRI 



<400> 12 

caattccctt gtatctctac acacaggctc 

cgtttatatc ttgcatcgtc ccaaagctat 
cttgaagtaa tctctgcaga tctttcgaca 
agcggcgagg agcctcgtcc tgtcacaact 
cgccagctca ttaagagcca gttcatgggc 
ttcaagtaca ccaacgctct tctgatccag 
tggttaagat ccacgtcttc gggaagccag 
ctgccaacag ctttctcagc cagggccagc 
ccgagaagaa ctggaggggt ggtgtcaagg 
cggagcggtg tcaagaggat attcttcgac 
ggaggtagca tagcttcatt tggatttgct 
gagggtcctt tggctttcaa tattctcaag 

ccttttattc accaatgagc attggaatga 
ggttttcgaa atacatccgg atgtcgaagg 
acgtggcact attgatcatc cgatagctct 
gttgctagca gttccaggtg gaatgttatg 
catgatctct agttagctca ccacaaaagt 
aagctgtaag gatttcggca cggctacgga 
gtaccattta attctatttg tgtttgatcg 
aaagtcgtat agctaccagt gaggaagtgg 
ataaacttta agcctaaact atacagaata 
aaatctgtcc agatcatggt tgaccggtgc 
gttgacctag ctgattctgg agtgacccag 
cctccaccat ttgtagaaaa atgtgacgaa 
ctttctggca tgcggagaga cggacggacg 
gccagacagc tctggcggct ctgaggtgca 
ccctccgccc cgaagtggaa aggctggtgt 
atcggagaat atggagcttc atcgaatcac 
ggggtgtata gccgtcggcg aaatagcatg 



aaatcaataa gaagaacggt tcgtcttttt GO 
tggcgggata ttctgtttgc agttggctga 12 0 
ctgaaatacg tcgagcctgc tccgcttgga 18 0 
accaacatgg agtacgataa gggccagttc 24 0 
gttggcatga tggccgtcat gcatctgtac 300 
tcgatcatcc gctgaaggcg ctttcgaatc 360 
cgactggtga cctccagcgt ccctttaagg 42 0 
ccaagaccga caaggcctcc ctccagaacg 48 0 
aggagtaagc tccttattga agtcggagga 54 0 
tctgtattat agataagatg atgaggaatt 60 0 
ttccaggctg agactctagc ttggagcata 650 
tatctcgagt ttgaacttat tccctgtgaa 720 
acatgaatct gaggactgca atcgccatga 78 0 
cttggggcac ctgcgttggt tgaatttaga 84 0 
gcaaagggcg ttgcacaatg caagtcaaac 900 
atgagcattg tattaaatca ggagatatag 96 0 
cagacggcgt aaccaaaagt cacacaacac 102 0 
agacggagaa gccaccttca gtggactcga 108 0 
agacctaata cagcccctac aacgaccatc 114 0 
actcaaatcg acttcagcaa catctcctgg 1200 
agataggtgg agagcttata ccgagctccc 12 6 0 
ctggatcttc ctatagaatc atccttattc 1320 
agggtcatga cttgagccta aaatccgccg 138 0 
ctcgtgagct ctgtacagtg accggtgact 1440 
cagagagaag ggctgagtaa taagccactg 1500 
gtggatgatt attaatccgg gaccggccgc 1560 
gcccctcgtt gaccaagaat ctattgcatc 1620 
cggcagtaag cgaaggagaa tgtgaagcca 1680 
ccattaacct aggtacagaa gtccaattgc 174 0 
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ttccgatctg gtaaaagatt cacgagatag taccttctcc gaagtaggta gagcgagtac laoo 
ccggcgcgta agctccctaa ttggcccatc cggcatctgt agggcgtcca aatatcgtgc 1860 
ctctcctgct ttgcccggtg tatgaaaccg gaaaggccgc tcaggagctg gccagcggcg 1920 
cagaccggga acacaagctg gcagtcgacc catccggtgc tctgcactcg acctgctgag 1980 
gtccctcagt ccctggtagg cagctttgcc ccgtctgtcc gcccggtgtg tcggcggggt 2040 

tgacaaggtc gttgcgtcag tccaacattt gttgccatat tttcctgctc tccccaccag 2100 

ctgctctttt cttttctctt tcttttccca tcttcagtat attcatcttc ccatccaaga 2160 

acctttattt cccctaagta agtactttgc tacatccata ctccatcctt cccatccctt 2220 

attcctttga acctttcagt tcgagctttc ccacttcatc gcagcttgac taacagctac 228 0 

cccgcttgag cagacatcac catggatcca cttaacgtta ctgaaatcat caaacagctt 234 0 

gacgaatctg gatataagat cgttggtgtc gatgtcagct ccggagttga gacaaatggt 2400 

gttcaggatc tcgataagat acgttcattt gtccaagcag caaagagtgc cttctagtga 2460 
tttaatagct ccatgtcaac aagaataaaa cgcgttttcg ggtttacctc ttccagatac 2520 
agctcatctg caatgcatta atgcattgac tgcaacctag taacgccttn caggctccgg 2580 
cgaagagaag aatagcttag cagagctatt ttcattttcg ggagacgaga fccaagcagat 264 0 
caacggtcgt caagagacct acgagactga ggaatccgct cttggctcca cgcgactata 2700 
tatttgtctc taattgtact ttgacatgct cctcttcttt actctgatag cttgactatg 2760 
aaaattccgt caccagcncc tgggttcgca aagataattg catgtttctt ccttgaactc 2820 
tcaagcctac aggacacaca ttcatcgtag gtataaacct cgaaatcant tcctactaag 2880 
atggtataca atagtaacca tgcatggttg cctagtgaat gctccgtaac acccaatacg 2940 
ccggccgaaa cttttttaca actctcctat gagtcgttta cccagaatgc acaggtacac 3 000 
ttgtttagag gtaatccttc tttctagaag tcctcgtgta ctgtgtaagc gcccactcca 3060 
catctccact cgacctgcag gcatgcaagc ttggcactgg ccgtcgtttt acaacgtcgt 3120 
gactgggaaa accctggcgt tacccaactt aatcgccttg cagcacatcc ccctttcgcc 318 0 
agctggcgta atagcgaaga ggcccgcacc gatcgccctt cccaacagtt gcgcagcctg 324 0 
aatggcgaat ggcgcctgat gcggtatttt ctccttacgc atctgtgcgg tatttcacac 3300 
cgcatatggt gcactctcag tacaatctgc tctgatgccg catagttaag ccagccccga 3360 
cacccgccaa cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac 3420 
agacaagctg tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg 3 48 0 
aaacgcgcga gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata 3540 
ataatggttt cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt 36 00 
tgtttatttt tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa 3 66 0 
atgcttcaat aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt 3720 
attccctttt ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa 3780 
gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac 3840 
agcggtaaga tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt 3 900 
aaagttctgc tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt 3960 
cgccgcatac actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat 4020 
cttacggatg gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac 408 0 
actgcggcca acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg 4140 
cacaacatgg gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc 4200 

ataccaaacg acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa 4260 
ctattaactg gcgaactact tactctagct tcccggcaac aattaataga ctggatggag 4 32 0 
gcggataaag ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct 4380 
gataaatctg gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat 4440 
ggtaagccct cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa 4500 
cgaaatagac agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac 4560 
caagtttact catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc 4620 
taggtgaaga tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc 4680 
cactgagcgt cagaccccgt agaaaagatc aaaggatctt cttgagatcc ttcttttctg 474 0 
cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg 4800 
gatcaagagc taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca 486 0 
aatactgtcc ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg 4 92 0 
cctacatacc tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg 4980 
tgtcttaccg ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga 504 0 
acggggggtt cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac 5100 
ctacagcgtg agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat 5160 
ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacacc 5220 
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cggtatcttt atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga 528 0 
tgctcgtcag gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc 534 0 
ctggcctttt gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg 54 0 0 
gataaccgta ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag 546 0 
cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa cacgcaaacc gcctctcccc 552 0 
gcgcgttggc cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc 558 0 

^gtgagcgca acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac 564 0 
tttatgcttc cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga 5700 
aacagctatg accatgatta cgaattgcgg ccg 573 3 



<210> 13 
<211> 39 
<212> DNA 

<213> Artificial Sequence 



<£220> 

<223> Description of Artificial Sequence: PCR 5' primer 
<400> 13 

gtcaaccgcg gactgcgcat catgaagttc ttcgccatc 39 



<210> 14 
<2ll> 66 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PCR 5' primer 
<400> 14 

tctagcaagc ttggctctag ttctggaacc gcaccaggcg gcagcaacgg caacggcaat 60 
9tttgc S5 



<210> 15 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PCR 3' primer 
<400> 15 

tcgtacaagc tttcaagcac cgacggcggt 



<210> 16 
<211> 33 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PCR 5' primer 
<400> 16 

tctagctcta gaagcaacgg caacggcaat gtt 
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<210> 17 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 3' primer 
<400> 17 

tgctagtcga cctgctagca gcaccgacgg cggtctg 37 

<210> 18 

<211> 4614 

<212> DNA 

<213> Saccharomyces cerevisiae 
<220> 

<223> 5. cerevisiae FLOl coding sequence 

<400> 18 

atgacaatgc ctcatcgcta tatgtttttg gcagtcttta cacttctggc actaactagt 60 
gtggcctcag gagccacaga ggcgtgctta ccagcaggcc agaggaaaag tgggatgaat 12 0 
ataaattttt accagtattc attgaaagat tcctccacat attcgaatgc agcatatatg 18 0 
gcttatggat atgcctcaaa aaccaaacta ggttctgtcg gaggacaaac tgatatctcg 240 
attgattata atattccctg tgttagttca tcaggcacat ttccttgtcc tcaagaagat 3 00 
tcctatggaa actggggatg caaaggaatg ggtgcttgtt ctaatagtca aggaattgca 350 
tactggagta ctgatttatt tggcttctat actaccccaa caaacgtaac cctagaaatg 420 
acaggttatt ttttaccacc acagacgggt tcttacacat tcaagtttgc tacagttgac 480 
gactctgcaa ttctatcagt aggtggtgca accgcgttca actgttgtgc tcaacagcaa 540 
ccgccgatca catcaacgaa ctttaccatt gacggtatca agccatgggg tggaagtttg 600 
ccacctaata tcgaaggaac cgtctatatg tacgctggct actattatcc aatgaaggtt 660 
gtttactcga acgctgtttc ttggggtaca cttccaatta gtgtgacact tccagatggt 72 0 
accactgtaa gtgatgactt cgaagggtac gtctattcct ttgacgatga cctaagtcaa 780 
tctaactgta ctgtccctga cccttcaaat tatgctgtca gtaccactac aactacaacg 840 
gaaccatgga ccggtacttt cacttctaca tctactgaaa tgaccaccgt caccggtacc 900 
sacggcgttc caactgacga aaccgtcatt gccatcagaa ctccaacaac tgctagcacc 960 
atcataacta caactgagcc atggaacagc acttttacct ctacttctac cgaattgacc 1020 
acagtcactg gcaccaatgg tgtacgaact gacgaaacca tcattgtaat cagaacacca 108 0 
acaacagcca ctactgccat aactacaact gagccatgga acagcacttt tacctctact 1140 
tctaccgaat tgaccacagt caccggtacc aatggtttgc caactgatga gaccatcatt 1200 

gtcatcagaa caccaacaac agccactact gccatgacta caactcagcc atggaacgac 1260 
acttttacct ctacatccac tgaaatgacc accgtcaccg gtaccaacgg tttgccaact 1320 
gatgaaacca tcattgtcat cagaacacca acaacagcca ctactgctat gactacaact 1380 
cagccatgga acgacacttt tacctctaca tccactgaaa tgaccaccgt caccggtacc 1440 
aacggtttgc caactgatga aaccatcatt gtcatcagaa caccaacaac agccactact 1500 
gccatgacta caactcagcc atggaacgac acttttacct ctacatccac tgaaatgacc 1560 
accgtcaccg gtaccaatgg tttgccaact gatgagacca tcattgtcat cagaacacca 162 0 
acaacagcca ctactgccat gactacaact cagccatgga acgacacttt tacctctaca 1680 
tccactgaaa tgaccaccgt caccggtacc aacggtttgc caactgatga aaccatcatt 174 0 
gtcatcagaa caccaacaac agccactact gccataacta caactgagcc atggaacagc 1800 
acttttacct ctacttctac cgaattgacc acagtcaccg gtaccaatgg tttgccaact 1860 
gatgagacca tcattgtcat cagaacacca acaacagcca ctactgccat gactacaact 192 0 
cagccatgga acgacacttt tacctctaca tccactgaaa tgaccaccgt caccggtacc 1980 
ciacggtttgc caactgatga aaccatcatt gtcatcagaa caccaacaac agccactact 204 0 
gccatgacta caactcagcc atggaacgac acttttacct ctacatccac tgaaatgacc 2100 
accgtcaccg gtaccaacgg tttgccaact gatgagacca tcattgtcat cagaacacca 2160 
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acaacagcca ctactgccat gactiacaacc 
tccactgaaa tgaccaccgt caccggtacc 
gtcatcagaa ctccaactag tgaaggtcta 
actttcacct ctacatccac cgagatgacc 
gacgaaaccg tgattgttat cagaactcca 
gaaccatgga ctggtacttt tacttctaca 
aacggcgttc caactgacga aaccgtcatt 
atcagcacca ccactgaacc atggactggt 
accattactg gaaccaatgg tcaaccaact 
actagtgaag gtctaatcag caccaccact 
tctactgaaa tgaccaccgt caccggtacc 
gtcatcagaa ctccaaccag tgaaggtcta 
actttcactt cgacttccac tgaggttacc 
gacgaaactg tgattgttat cagaactcca 
gaaccatgga ctggtacttt cacttctaca 
aacggtcaac caactgacga aaccgtgatt 
gttacaacca ccactgaacc atggactggt 
actgtcactg gaaccaatgg cttgccaact 
actactgcca tctcatccag ttcgtcatca 
acgtcttcgc gtccaattat taccccattc 
tcctcagtaa tttcttcctc agtcacttct 
tcctcagtca tttcttcttc tacaacaacc 
tcatccgtca ttccaaccag tagttccacc 
gctggttctg tctcttcttc ctcttttatc 
tcttcttcat cattaccact tgttaccagt 
ttaccacctg ctaccactac aaaaacgagc 
tgcgagtctc atgtgtgcac tgaatccatc 
actgttagcg gcgtcacaac agagtatacc 
acaaagcaaa ccaaagggac aacagagcaa 
gttacaattt cttcttgtga atctgacgta 
tctacaagca ctgctactat taacggcgtt 
tccaccacag aatcgaggca acaaacaacg 
gtgtgttccg aaactgcttc acctgccatt 
gttgttacgg tctatcctac atggaggcca 
aaaatgaaca gtgctaccgg tgagacaaca 
aatactgtag ctgctgagac gattaccaat 
acctcttcgc tttcaagatc taatcacgct 
attggtcaca gcagtagtgt tgtttctgta 
agttccgggt tgagtactat gtcgcaacag 
ggatatagta cagcttcttt agaaatttca 
gccggtagtg gtttaagtgt cttcattgcg 



cagccatgga acgacacttt tacctctaca 2220 

aacggcgttc caactgacga aaccgtcatt 22&0 

atcagcacca ccactgaacc atggactggt 234 0 

accgtcaccg gtactaacgg tcaaccaact 2400 

accagtgaag gtttggttac aaccaccact 2460 

tctactgaaa tgaccaccat tactggaacc 252 0 

gtcatcagaa ctccaaccag tgaaggtcta 2580 

acttttactt ctacatctac tgaaatgacc 2640 

gacgaaaccg ttattgttat cagaactcca 27 00 

gaaccatgga ctggtacttt cacttctaca 2760 

aacggcgttc caactgacga aaccgtcatt 2820 

atcagcacca ccactgaacc atggactggc 2880 

accatcactg gaaccaacgg tcaaccaact 2940 

accagtgaag gtctaatcag caccaccact 3000 

tctgctgaaa tgaccaccgt caccggtact 3 060 

gttatcagaa ctccaaccag tgaaggtttg 3120 

acttttactt cgacttccac tgaaatgtct 318 0 

gatgaaactg tcattgttgt caaaactcca 324 0 

tcatcttcag gacaaatcac cagctctatc 3300 

tatcctagca atggaacttc tgtgatttct 336 0 

tctctattca cttcttctcc agtcatttct 3420 

tccacttcta tattttctga atcatctaaa 3480 

tctggttctt ctgagagcga aacgagttca 354 0 

tcttctgaat catcaaaatc tcctacatat 3600 

gcgacaacaa gccaggaaac tgcttcttca 3660 
gaacaaacca ctttggttac cgtgacatcc 3 72 0 

tcccctgcga ttgtttccac agctactgtt 3780 

acatggtgcc ctatttctac tacagagaca 384 0 
accacagaaa caacaaaaca aaccacggta 3 900 
tgctctaaga ctgcttctcc agccattgta 3 96 0 
actacagaat acacaacatg gtgtcctatt 4020 
ctagttactg ttacttcctg cgaatctggt 4 08 0 
gtttcgacgg ccacggctac tgtgaatgat 414 0 
cagactgcga atgaagagtc tgtcagctct 42 00 
accaatactt tagctgctga aacgactacc 4260 
actggagctg ctgagacgaa aacagtagtc 4320 
gaaacacaga cggcttccgc gaccgatgtg 4 380 
tccgaaactg gcaacaccaa gagtctaaca 4440 
cctcgtagca caccagcaag cagcatggta 4500 
acgtatgctg gcagtgccaa cagcttactg 4560 
tccttattgc tggcaattat ttaa 4614 



<210> 19 
<211> 5857 
<212> DNA 

<213> Saccharornyces cerevisiae and B. colx 
<220> 

<221> promoter 
<:222> (1) . . (452) 

<223> S. cerevzsiae GALl promoter 
<220> 

<223> (476-495) E. coli T7 promoter /priming site 
<220> 

<223> (502-501) E. coli multiple cloning site 
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<22Q> 

<223> (609-857) S, cerevislae CYCl transcription terminator 
<22D> 

<223> U039-1712) £. coli pMBl (pUC-derived; origin 
<220> 

<22l> gene 

<222> (1857) . . (2717) 

<223> E, coll ampicillin resistance gene 
<220> 

<221> gene 

<222> (2735) , . (3842) 

<22 3> S. cerevislae tJRA3 gene 

<220> 

<223> (3846-5317) S, cerevisiae 2 micron origin 
<220> 

<223> (5385-5840) E, coli fl origin 
<220> 

<223> (1-5857) Sequence of pYES2 
<400> 19 

acggattaga agccgccgag cgggtgacag ccctccgaag gaagactctc ctccgtgcgt 60 
cctcgtcctc accggtcgcg ttcctgaaac gcagatgtgc ctcgcgccgc actgctccga 12 o 
acaataaaga ttctacaata ctagctttta tggttatgaa gaggaaaaat tggcagtaac 180 
ctggccccac aaaccttcaa atgaacgaat caaattaaca accataggat gataatgcga 24 0 
ttagtttttt agccttattt ctggggtaat taatcagcga agcgatgatt tttgatctat 3 00 
taacagatat ataaatgcaa aaactgcatt aaccacttta actaatactt tcaacatttt 36 0 
cggtttgtat tacttcttat tcaaatgtaa taaaagtatc aacaaaaaat tgttaatata 420 
cctctatact ttaacgtcaa ggagaaaaaa ccccggatcg gactactagc agctgtaata 480 
cgactcacta tagggaatat taagcttggt accgagctcg gatccactag taacggccgc 54 0 
cagtgtgctg gaattctgca gatatccatc acactggcgg ccgctcgagc atgcatctag ^00 

sgggccgcat catgtaatta gttatgtcac gcttacattc acgccctccc cccacatccg 66 0 

ctctaaccga aaaggaagga gttagacaac ctgaagtcta ggtccctatt tattttttta 72 0 
tagttatgtt agtattaaga acgttattta tatttcaaat ttttcttttt tttctgtaca 780 
gacgcgtgta cgcatgtaac attatactga aaaccttgct tgagaaggtt ttgggacgct 840 
cgaaggcttt aatttgcggc cctgcattaa tgaatcggcc aacgcgcggg gagaggcggt 900 
ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg 96 0 
ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg 102 0 
gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa agcccaggaa ccgtaaaaag 1080 
gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga 114 0 
cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct 1200 
ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc 1260 
tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg 132 0 
gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc 13 8 0 
tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca 1440 
ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag 1500 
ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct 1560 
ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc 162 0 
accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga 168 0 
tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca 1740 
cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat 1800 
taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac 1860 
caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt 1920 



wo 00/58342 



12 



PCT/FIOO/00249 



gcctgactcc ccgccgtgta gataactacg 

gctgcaatga taccgcgaga cccacgctca 

ccagccggaa gggccgagcg cagaagtggt 

attaattgtt gccgggaagc tagagtaagt 

gttggcattg ctacaggcat cgtggtgtca 

tccggttccc aacgatcaag gcgagttaca 
agctccttcg gtcctccgat cgttgtcaga 

gttatggcag cactgcataa ttctcttact 

actggtgagt actcaaccaa gtcattctga 

tgcccggcgt caatacggga taatagtgta 

attggaaaac gttcttcggg gcgaaaactc 

tcgatgtaac ccactcgtgc acccaactga 

^ctgggtgag caaaaacagg aaggcaaaat 

aaatgttgaa tactcatact cttccttttt 
aagctctaat ttgtgagttt agtatacatg 
gctggccgca tcttctcaaa tatgcttccc 
ccttagcatc ccttcccttt gcaaatagtc 
tagagaccac atcatccacg gttctatact 
aacccacacc gggtgtcata atcaaccaat 
tttgagcaat aaagccgata acaaaatctt 
fcagtatattc tccagtagat agggagccct 
ctctaggttc ctttgttact tcttctgccg 

ccaccacacc gtgtgcattc gtaatgtctg 
agtactgcaa tttgactgta ttaccaatgt 
aattgtactt ggcggataat gcctttagcg 
tcaagatatc cacatgtgtt tttagtaaac 
ccagtaattc cttggtggta cgaacatcca 
gcatgatatt aaatagcttg gcagcaacag 
atgtagcttt cgacatgatt tatcttcgtt 
ttaagaatac tgggcaattt catgtttctt 
taagtctgtg ctccttcctt cgttcttcct 
ttcaaagaaa ccgaaatcaa aaaaaagaat 
ctagcttatc gatgataagc tgtcaaagat 
ctagatactc cgtctactgt acgatacact 
cttaccactc ttttgttact ctattgatcc 
ctatcttcgc gatgtagtaa aactagctag 
acttctacaa tggctgccat cattattatc 
tcgaatacgc tttgaggaga tacagcctaa 
tcgtacctgt tacccatcat tgaattttga 
agatagtata tttgaacctg tataataata 
atgtatttcg gttcctggag aaactattgc 

atccccggtt cattttctgc gtttccatct 
agcatctgtg cttcattttg tagaacaaaa 
aaagaatctg agctgcattt ttacagaaca 
acgaagaatc tgtgcttcat ttttgtaaaa 
tttcaaacaa agaatctgag ctgcattttt 
tttcaccaac aaagaatcta tacttctttt 
tatttttcta acaaagcatc ttagattact 
gtctcttgat aactttttgc actgtaggtc 
tctattttct cttccataaa aaaagcctga 
gaagctgcgg gtgcattttt fccaagataaa 
ggattgcgca tactttgtga acagaaagtg 
aattatgaac ggtttcttct attttgtctc 
ttcgtattgt tttcgattca ctctatgaat 
gtaatactag agataaacat aaaaaatgta 

cgaaaggtgg atgggtaggt tatataggga 
cttttgagca atgtttgtgg aagcggtatt 
tcagaaaagc cccaaaaaca ggaagattgt 



atacgggagc gcttaccatc tggccccagt 198 0 
ccggctccag atttatcagc aataaaccag 2 04 0 

cctgcaactt tatccgcctc cattcagtct 2100 

agttcgccag ttaatagttt gcgcaacgtt 2160 

ctctcgtcgt ttggtatggc ttcattcagc 222C 

tgatccccca tgttgtgcaa aaaagcggtt 2280 

agtaagttgg ccgcagtgtt atcactcatg 2340 

gtcatgccat ccgtaagatg cttttctgtg 24 00 

gaatagtgta tgcggcgacc gagttgctct 246 0 

tcacatagca gaactttaaa agtgctcatc 2520 
tcaaggatct taccgctgtt gagatccagt 258 0 
tcttcagcat cttttacttt caccagcgtt 2640 
gccgcaaaaa agggaatiaag ggcgacacgg 2700 
caatgggtaa taactgatat aattaaattg 2760 
catttactta taatacagtt ttttagtttt 2820 
agcctgcttt tctgtaacgt tcaccctcca 2680 
ctcttccaac aataataatg tcagatcctg 2940 
gttgacccaa tgcgtctccc ttgtcatcta 3000 
cgtaaccttc atctcttcca cccatgtctc 3 06 0 
tgtcgctctt cgcaatgtca acagtaccct 312 0 
tgcatgacaa ttctgctaac atcaaaaggc 3180 
cctgcttcaa accgctaaca atacctgggc 324 0 
cccattctgc tattctgtat acacccgcag 3 3 00 
cagcaaattt tctgtcttcg aagagtaaaa 3360 
gcttaactgt gccctccatg gaaaaatcag 3420 
aaattttggg acctaatgct tcaactaact 3480 
atgaagcaca caagtttgtt tgcttttcgt 3540 
gactaggatg agtagcagca cgttccttat 3600 
tcctgcaggt ttttgttctg tgcagttggg 3660 
caacactaca tatgcgtata tataccaatc 3720 
tctgttcgga gattaccgaa tcaaaaaaat 378 0 
aaaaaaaaaa tgatgaaccg aattgaaaag 3840 

gagaattaat tccacggact atagactata 3900 

tccgctcagg tccttgtcct ttaacgaggc 3 960 
agctcagcaa aggcagtgtg atctaagatt 4020 
accgagaaag agactagaaa tgcaaaaggc 4 08 0 
cgatgtgacg ctgcagcttc tcaatgatat 414 0 
tatccgacaa actgttttac agatttacga 4200 
acatccgaac ctgggagttt tccctgaaac 4260 
tatagtctag cgctttacgg aagacaacgt 4320 
atctattgca taggtaatct tgcacgtcgc 4380 
tgcacttcaa tagcatatct ttgttaacga 444 0 
atgcaacgcg agagcgctaa tttttcaaac 4 500 
gaaatgcaac gcgaaagcgc tattttacca 4 56 0 
caaaaatgca acgcgacgag agcgctaatt 462 0 
acagaacaga aatgcaacgc gagagcgcta 4 68 0 
ttgttctaca aaaatgcatc ccgagagcgc 4 74 0 
ttttttctcc tttgtgcgct ctataatgca 48 00 
cgttaaggtt agaagaaggc tactttggtg 4860 
ctccacttcc cgcgtttact gattactagc 4920 
ggcatccccg attatattct ataccgatgt 4980 
atagcgttga tgattcttca ttggtcagaa 5040 
tatiatactac gtataggaaa tgtttacatt 5100 
agttcttact acaatttttt tgtctaaaga 5160 

gaggtcgagt ttagatgcaa gttcaaggag 5220 

tatagcacag agatatatag caaagagata 5280 
cgcaatggga agctccaccc cggttgataa 534 0 
ataagcaaat atttaaattg taaacgtt^a 54 00 



wo 00/58342 



13 



PCT/FIOO/00249 



tattttgtta aaattcgcgt taaatttttg ttaaatcagc tcatttttta acgaatagcc 5460 
cgaaatcggc aaaatccctt ataaatcaaa agaatagacc gagatagggt tgagtgttgt 5520 
tccagtttcc aacaagagtc cactattaaa gaacgtggac tccaacgtca aagggcgaaa 5580 
aagggtctat cagggcgatg gcccactacg tgaaccatca ccctaatcaa gttttttggg 564 0 
gtcgaggtgc cgtaaagcag taaatcggaa gggtaaacgg acgcccccat ttagagcttg 5700 
acggggaaag ccggcgaacg tggcgagaaa ggaagggaag aaagcgaaag gagcgggggc 5760 
tagggcggtg ggaagtgtag gggtcacgct gggcgcaacc accacacccg ccgcgcttaa 5820 
^9999cgcza. cagggcgcgt ggggatgatc cactagt 5857 



<210> 20 
<211> 403 
<212> DNA 

<213> Trichoderma reesei 



<220> 

<223> (1-403) T, reesei hf]o2 coding sequence 



<220> 

<221> intron 
<222> (131) . . (200) 

<220> 

<221> intron 
<222> (287) . . (358) 



<400> 20 

atgcagttct tcgccgtcgc cctcttcgcc 
ggcctcttct ccaaccctct gtgctgtgcc 
tgcaagaccc gtatgttgaa ttccaatctc 
acttacacga tgctttacag ctaccatcgc 
ctgtgccagc aagggctcca agcctctttg 
caatggcaaa gaagtaaaaa gacatttggg 
cgaccaggct ctcctgtgcc agaaggccat 



accagcgccc tggctgctgt ctgccctacc 60 
accaacgtcc tcgacctcat tggcgttgac 120 
tgggcatcct gacattggac gatacagttg 180 
cgtcgacact ggcgccatct tccaggctca 240 
ctgcgttgct cccgtggtaa gtagtgctcg 3 00 
cctgggatcg ctaactcttg atatcaaggc 360 
cggcaccttc taa 4 03 



<210> 21 
<211> 59 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 5' primer 
<400> 21 

cggaggagct cgacgacttc gagcagcccg agctgcacgc aggctgtctg ccctaccgg 59 



<2io> 22 

<211> 29 
<:212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 3' primer 



<400> 22 

tcattggatc cttagaaggt gccgatggc 



29 
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<210> 23 
<211> 679 
<212> DNA 

<213> Schizophyllum commune 
<220> 

<223> (1-679) SC3 coding sequence 
<220> 

<223> (1-92) 1st CDNA 
<220> 

<223> (146-183) 2nd cDNA 
<220> 

<223> {240-317} 3rd cDNA 
<220> 

<223> (374-469) 4th cDNA 
<220> 

<223> (524-586) 5th cDNA 
<220> 

<223> (635-679) 6th cDNA 
<400> 23 

atgttcgccc gtctccccgt cgtgttcctc tacgccttcg tcgcgttcgg cgccctcgtc 60 
gctgccctcc caggtggcca cccgggcacg acgtacgtcg acctctcacc gtcctctaat 120 
gtcttgctga tgaagccccg tatagcacgc cgccggttac gacgacggtg acggtgacca 180 
cggtgagtag ctttctcgcc gtcgacgact cgaacgcatt ggctaatttt tgctcatagc 24 0 
cgccctcgac gacgaccatc gccgccggtg gcacgtgtac tacggggtcg ctctcttgct 300 
gcaaccaggt tcaatcggta cgtacatcaa agcggcacga ccaggcatct cagctgacgg 360 
ccacatcgta caggcgagca gcagccctgt caccgccctc ctcggcctgc tcggcattgt 42 0 
cctcagcgac ctcaacgttc tcgttggcat cagctgctct cccctcactg tgagatcttt 48 0 
ttgttcactg tcccaattac tgcgcactga cagactttgc caggtcatcg gtgtcggagg 540 
cagcggctgt tcggcgcaga ccgtctgctg cgaaaacacc caattcgtat gtatactttc 6 00 
catgcgtgtc cctttctccg ctaatcatct gtagaacggg ctgatcaaca tcggttgcac 660 
ccccatcaac atcctctga 679 



<210> 24 

<211> 63 
<212> DKfA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: PCR 5' primer 



<400> 24 

actacacgga ggagctcgac gacttcgagc agcccgagct gcacgcaggg tggccacccg 60 

ggc 63 
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<210> 25 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 3' primer 
<400> 25 

tcgtacggat ccccagagga tgttgatggg 3 0 



<210> 26 

<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description Of Artificial Sequence: PGR 5' primer 
<400> 26 

ggaattccgc ggactgcgca tcatgaagtt cttcgccatc gcc 43 



<2iO> 27 
<211> 80 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 3' primer 
<400> 27 

tgaattccat atgttaggta ccaccggggc ccatgccggt agaagtagaa gccccgggag 60 
caccgacggc ggtctggcac 8 0 



<210> 28 
<211> 31 
<212> DNA 

<2135- Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 5' primer 
<400> 28 

tgaattcggt acccaggctt gctcaagcgt c 31 



<210> 29 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<:220> 

<223> Description of Artificial Sequence; PCR 3' primer 



i 
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<400> 29 

tgaattccat atgtcacagg cactgagagt agta 34 



<210> 30 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 5* primer 
<400> 30 

gaattcggta ccctcgtccc tcgcggtccc gccgaagtga acctggtg 4 8 



<210> 31 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 3' primer 
<400> 31 

tgaattccat atgctaaccc cgtttcatct ccag 34 



<2io> 32 

<211> 918 
<212> DNA 

<213> TrichodermsL reesei 



<220> 

<221> terminator 
<222> {!) . , (918) 

<223> T. reesei hfbl terminator 



<400> 32 

gatgcccgcc cggggtcaag gtgtgcccgt gagaaagccc acaaagtgtt gatgaggacc 60 
atttccggta ctgggaaagt tggctccacg tgtttgggca ggtttgggca agttgtgtag 120 
atattccatt cgtacgccat tcttattctc caatatttca gtacactttt cttcataaat 180 
caaaaagact gctattctct ttgtgacatg ccggaaggga acaattgctc ttggtctctg 24 0 
ttatttgcaa gtaggagtgg gagattcgcc ttagagaaag tagagaagct gtgcttgacc 300 
gtg^tgtgac tcgacgagga tggactgaga gtgttaggat taggtcgaac gttgaagtgt 360 
atacaggatc gtctggcaac ccacggatcc tatgacttga tgcaatggtg aagatgaatg 42 0 
acagtgtaag aggaaaagga aatgtccgcc ttcagctgat atccacgcca atgatacagc 4 80 
gatatacctc caatatctgt gggaacgaga catgacatat ttgtgggaac aacttcaaac 540 
agcgagccaa gacctcaata tgcacatcca aagccaaaca ttggcaagac gagagacagt 6 00 
cacattgtcg tcgaaagatg gcatcgtacc caaatcatca gctctcatta tcgcctaaac 660 
cacagattgt ttgccgtccc ccaactccaa aacgttacta caaaagacat gggcgaatgc 720 
aaagacctga aagcaaaccc tttttgcgac ticaattccct cctttgtcct cggaatgatg 780 
atccttcacc aagtaaaaga aaaagaagat tgagataata catgaaaagc acaacggaaa 840 
cgaaagaacc aggaaaagaa taaatctatc acgcaccttg tccccacact aaaagcaaca 9 00 
g999gsrgtaa aatgaaat 918 
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<210> 33 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 5' primer 
<40Q> 33 

gacctcgatg cccgcccggg gtcaag 26 



<210> 34 
<211> 26 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: PGR 3' primer 
<400> 34 

gtcgacattt cattttaccc ccctcg 26 



<210> 35 
<211> 1190 
<212> DNA 

< 2 1 3 > Tri chodeirma. reesel 



<220> 

<221> promoter 

<222> <1) . . (1190) 

<223> T. reesei hfb2 promoter 



<400> 35 

ctcgagcagc tgaagcttgc atgcctgcat 
acactgccgt cgacgtctct cttccgacct 
cgctttgtat tattagagta tatgcaagtc 
aggacgatcg cggcacgata cgccctcgtt 
ggagcaagat tcagcccaag ggagatggac 
ccactccctt tggccaaatc agcttgcatg 
actaacaacc agggatggga aacgaagcct 
gagagaggga gagagacaat gagtgccaca 
aatgtcacat ccgagtcttg gggcctctgt 
aatagtcctc tttcttgagg actggatacc 
tttcgaagta ttatctggag gatagaagac 
ttggggagta gcccaacgaa aggtaactcc 
cagcctcttg gagccgtcat agatcacatc 
ctcatcatgg ccacatacta ctacatacgt 
caagacacca aggccccagc atcaagcctc 
gagagattgc gatgagccct ctccctacct 
agccaacgtg atgaaaacaa ggacatgagg 
ggccagaaaa aaaagtataa agaagacctc 
ctcatcagca cactcatcta caaccatcac 
tccaaacaca aacattctct gttgaatacc 



cctttgtgag cgactgcatc cattttgcac 60 
tggccagctg gacaagcaac acaccaatga 120 
tcaggactat cgactcaact ctacccaccg ISO 
ctcattggcc caagcagacc aactgcccct 24 0 
ggcagggcac gccaggcccc caccaccaag 300 
tcaagagaca tcgagctgtg ccttgaaatt 360 
gcttttggaa agacaacaat gagagagaga 420 
aacctggtag tgctccgcca atgcgtctga 48 0 
gagaatgtcc agagtaatac gtgttttgcg 54 0 
tacgataccc tttttgagtt gatgcggtgc 6 00 
gtctaggtaa ctacacaaaa ggcctatact 660 
tacggcctct tagagccgtc atagatccta 72 0 
tgtgtagacc gacattctat gaataatcat 780 
gtctctgcct acctgacatg tagcagtggc 84 0 
cctacctatc ccttccattg tacagcggca 900 
acagacggct gacaatgtcc gtataccacc 96 0 
aacagcctgc gagagctgga agatgaagag 1020 
gattcccgcc atccaacaat cttttccatc 1080 
cacattcact caactcctct ttctcaactc 1140 
aaccatcacc acctttcaag 1190 
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<210> 36 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 5' primer 
<400> 36 

aagcttgcat gcctgcatcc 20 



<210> 37 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 3' primer 
<400> 37 

ccatggtgaa aggtggtgat ggttgg 26 



<210> 38 
<211> 13 
<212i. PRT 

<213> Trichodeuna reesel 

<22Q> 

<223> vild type T. reesei EGI peptide linker 
<400> 38 

Val Pro Arg Gly Ser Ser Ser Gly Thr Ala Pro Gly Gly 
15 10 



<210> 39 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: modified CBHII linker 
<400> 39 

Gly Ser Ser Ser Gly Thr Ala Pro Gly Gly 
15 10 



<210> 40 
<211> 19 
<212> PRT 

<213> Artificial Sequence 



wo 00/58342 



19 



PCT/FIOO/00249 



<220> 

<223> Description of Artificial Sequence: Met /Thrombin 
linker 

<400> 40 

Pro Gly Arg Pro Val Leu Thr Gly Pro Gly Met Gly Thr Ser Thr Ser 
15 10 15 

Ala Gly Pro 



<210> 41 

<211> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Met -containing linker 
<400> 41 

Pro Gly Ala Ser Thr Ser Thr Gly Met Gly Pro Gly Gly 
15 10 



<210> 42 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: linker containing the thrombin 
cleavage site 

<400> 42 

Gly Thr Leu Val Pro Arg Gly Pro Ala Gly Val Asn Leu Val 
15 10 



